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Transcriptional Activity Assay 



Field of the invention 



5 The present invention relates to a novel method for determining the biological 

activity of compounds which can modulate gene transcription, and based on this method, 
a method for the determination of the relative biological activity of a compound and a 
method for the identification of compounds which modulate gene transcription. The 
methods of this invention are particularly useful for identifying compounds that may be of 

10 use in the treatment of diseases which depend on the modulation of gene transcription 
within affected cells and tissues. 



Background 

15 The biological activity of compounds that regulate cellular functions such as cell 

proliferation or viral protection is commonly determined by measuring increases or 
decreases of binding activities of these factors and/or compounds either to native or 
recombinant receptors in vitro, or in cell-based binding assays. A disadvantage of these 
assays for the evaluation of the biological activity of compounds resides in the fact that the 

20 binding affinity does not necessarily correlate with the biological activity of a compound. 
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A more direct and, therefore, more reliable determination of the biological activity of 
compounds can be achieved by well known assays such a cell proliferation , cell survival or 
cell migration assays. Such assays are, however, more cumbersome, especially for larger 
scale screenings. In many cases* compounds which regulate such cellular activities also 

5 modulate gene transcription. Thus, the biological activity of such compounds may be 
determined by measuring the modulation of gene transcription, which is commonly done 
by reporter assays, which necessitate the generation of recombinant cells expressing a 
suitable reporter construct. Alternatively, the biological activity of such compounds may 
be determined by measuring the expression of suitable target genes, eg. by Northern Blot 

10 analysis or quantitative PCR, which is, however not efficient. 



Description of the invention 



15 The problem of the present invention was to provide a reliable and easy method for 

the determination of the biological activity of a modulator of transcriptional activity which 
lacks the disadvantages of binding assays, but is more efficient than the commonly used 
activity assays. 



20 Surprisingly, it was found that the biological activity of a compound can be 

determined more reliably by using a novel method based on the determination of the 
general transcriptional gene response induced by said compound comprising quantitation 
of signal intensities of genes which are transcriptionally regulated by said compound. 



25 



30 



The present invention provides a novel method for determining the biological 
activity of a compound which can modulate gene transcription, comprising the steps of a) 
contacting a host with a compound; b) determining the general transcriptional gene 
response of the host; and c) quantitating the general transcriptional gene response induced 
by said compound: Preferably, the biological activity of said compound is given by the sum 
of the normalised average difference of signal intensities quantitated in step c). In a more 
preferred embodiment of the present invention the general transcriptional gene response 
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is determined by an DNA array technology. In an even more preferred embodiment, the 
array technology is an oligonucleotide array technology. Preferably the DNA or 
oligonucleotide arrays are high density DNA or oligonucleotide arrays. Such arrays, and 
methods of analysing DNA or RNA using such arrays have been described previously, eg. 
5 in EP0834575, EP0834576, W096/31622, US5837832 or WO98/30883. WO97/10365 

provides methods for monitoring of expression levels of a multiplicity of genes using high 
density oligonucleotide arrays. In a most preferred embodiment, the oligonucleotide 
arrays are GeneChip arrays. 

10 The term " general transcriptional gene response " as used herein refers to the 

biological activity of a compound given by the sum of the normalised average difference of 
signal intensities of genes regulated by said compound in a responsive host. The 
determination of the general transcriptional gene response is achieved by hybridizing 
labeled cDNA from the treated hosts with DNA or more preferably oligonucleotide arrays. 

15 The raw data of the arrays is then measured, eg. with a confocal laser scanner (Hewlett 
Packard, U.S.) as a non-limiting example, and pixel levels are analysed using commonly 
known methods. The expression level for each gene is calculated as a normalised average 
difference of intensity as compared to hybridisation to mismatched oligonucleotides, and 
expressed in arbitrary units (AD). Four criteria are applied to select differentially expressed 

20 genes: (1) The intensity value of treated cells was at least two fold higher than the intensity 
value of untreated cells. (2) The intensity value in the treated cells should be higher or at 
least 50 arbitrary units (AD). (3) The intensity value of the untreated cells should be lower 
than 50 AD. ( 4) The standard deviation has to be significantly lower than the absolute 
' change in average difference and the calculated confidence level of a gene is set to greater 

25 than 97% (p value < 0.03). The general transcriptional gene response of the host induced 
by a compound is then quantitated by obtaining the sum of the normalised average 
difference of signal intensities (sum of arbitrary units, Z AD). 

The non-limiting examples described hereinafter demonstrate that the results 
30 obtained with this novel method more reliably correlate with biological activity than do 
binding assays, with the additional advantage of being efficient. They also show that it 
determining the general transcriptional gene response by the method of the present 
invention is more reliable than the determination of the transcriptional expression levels 
of an individual gene or a small selection of genes. 
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The host used for the method of the present invention may be a whole non-human 
organism, which includes, but is not limited to a transgenic or a knockout animal; specific 
types of tissue from any species, which includes, but is not limited to, liver, heart , kidney, 

5 or primary cell isolates, such as blood cells or primary cells isolated from a tissue, or cell 
lines. The host or hosts used to determine the biological activity of a compound by the 
present invention has to be responsive to a compound of interest. The term "responsive" as 
used herein refers to the ability of a host to exhibit elevated or reduced transcription of 
genes in response to a compound. As a non-limiting example, a melanoma cell line ME 15 

10 (CNCM 1-2546 in WO02/18633) can be used to determine the biological activity of PEG- 
IFN or PEG-IFN isoforms by the method of the present invention. 

The major advantages of the present invention, as compared to the commonly used 
activity assays or methods for determining gene transcription such as reporter assays or 

15 Northern Blotting are that the method does not require the preparation of any cell lines 
transfected with reporter constructs, and it allows for an easy and efficient analysis of a 
multitude of genes that are transcriptionally regulated, thus rendering the activity assays 
more efficient and reliable. Furthermore, as shown in Figure 4, binding activities do not 
always correlate with biological activity, while the determination of the transcriptional 

20 activity of compounds based on the determination of the general transcriptional gene 

response does correlate. Thus, the method of the present invention allows a more accurate 
determination of the biological activity of compounds than do binding assays. 

In one embodiment, the compound of the method hereinbefore described is a 
25 modified compound, more preferably a modified protein, even more preferably a 

pegylated protein, or most preferably, a specific isolated isoform of a pegylated protein. 
Preferably, the pegylated protein is erythropoietin (EPO) or interferon (IFN). 

The term "PEG-IFN" as used herein includes IFN-as derived from any natural 
30 material (e.g., leukocytes, fibroblasts, lymphocytes) or material derived therefrom (e.g. cell 
lines), or those prepared with recombinant DNA technology. Details of the cloning of 
IFNcc and the direct expression thereof, especially in E. coli, have been the subject of many 
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publications. The preparation of recombinant IFNcxs is known, for example from Goeddel 
et al. (1980) Naturelte, 316-320 and (1981), Nature 290, 20-26, and European Patents 
Nos. 32134, 43980 and 21 1 148. There are many types of IFNa such as IFNcxI, IFNcc2; and 
further their subtypes including but not limited to IFN<x2A, IFNa2B, IFNoc2C and IFNocII 
5 (also designated IFNocII or o-IFN). The term "IFNcc" also includes consensus IFNa 
available from Amgen or mixtures of natural and/or recombinant IFNocs. The use of 
IFNCX2A is preferred. The manufacture of IFNcc2A is described in European Patents Nos. 
43980 and 211148. 

10 The IFN-cc is conjugated to a polymer such as a polyalkylene glycol (substituted or 

unsubstituted), for example, polyethylene glycol, to form PEG-IFN-cc conjugate. 
Conjugation may be accomplished by means of various linkers known in the art, in 
particular by linkers such as those disclosed in European Patent Applications, Publication 
Nos. 0510356, 0593868 and 0809996. The molecular weight of the polymer, which is 

15 preferably polyethylene glycol, may range from 300 to 70.000 daltons, and one or more, 
preferably one to three, polymers may be conjugated to the IFN-a . A preferred PEG-IFN- 
a conjugate has the formula: 



ROCH 2 CH 2 (OCH 2 CH 2 )n— O C — NH 



RX)CH 2 CH 2 (OCH 2 CH 2 )n^O C 

O 



NH 




C — X IFN-aIpha2A 

II 
O 



where R and R 1 are methyl, X is NH, and n and n' are individually or both either 420 



20 or 520. 



In another preferred embodiment of the method hereinbefore described, the 
transcriptionally regulated genes are one or more genes selected from the group consisting 
of Seq ID No. 1 to 29, which are also listed in Table 1. 
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The method of the present invention is suitable for the analysis of any compound 
that modulates gene transcription in any type of host. Modulation of gene transcription 
includes both upregulation or downregulation of gene transcription. 



5 The method hereinbefore described is the basis for a method for determining the 

relative biological activity of a modified compound, comprising measuring the biological 
activity of the modified compound and the respective unmodified compound by the 
method hereinbefore described, wherein the relative transcriptional activity of the 
modified protein is determined by calculating the ratio of the biological activity of the 

10 modified compound and the biological activity of the unmodified compound. Preferably, 
the modified compound is a chemically or enzymatically modifed compound. More 
preferably, the modified compound is a pegylated protein. Thus, this method provides an 
easy assay to compare the biological activity of modified compounds with their 
unmodified counterparts. In a preferred embodiment, the present invention allows to 

15 select the isoform of a compound with the highest biological activity isolated from a 
mixture of isoforms, most preferably from pegylated proteins. 



The compounds whose biological activity can be determined by the method of this 
invention include any compound that can modulate gene transcription. These compounds 

20 can be proteins, polypeptides, nucleic acids, polymers or small molecules. Many proteins, 
polypeptides, nucleic acids, polymers or small molecules can modulate gene transcription, 
either directly, by acting as transcription factors, or indirectly, by inducing signalling 
cascades that ultimately up- or downregulate gene transcription or alternatively, by 
interfering with other compounds that modulate gene transcription. The following list of 

25 compounds whose biological activity can be determined by the method of the present 
invention exemplifies in a non-limiting way the kinds of compounds, including proteins 
and polypeptides, for which the method of the present invention can be used. 



Among these non-limiting examples are hormones, such as retinoid acic, that induce 
30 nuclear hormone receptors, which are transcription factors (A. Rowe: Retinoid X 

receptors, Int. /. Biochem. Cell Biol, 29 (1997), 275-278; and CK. Glass: Some new twists in 
the regulation of gene expression by thyroid hormone and retinoic acid receptors, /. 
Endorinol, 150 (1996), 349-357). The method of the present invention may be used to 
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determine the biological activity of such hormones and their nuclear hormone receptors, 
and it may also be used to identify compounds that interfere with the transcriptional 
activity of such nuclear hormone receptors.Thus, the present invention may, for example, 
be useful for identifying and analysing inhibitors of nuclear hormone receptors such as 
5 RXR or LXR. 



Further to hormones that activate nuclear hormone receptors, the method of this 
invention may also be used to determine the biological activity of other hormones which 
activate signalling cascades that lead up to the modulation of gene transcription. As a non- 
10 limiting example, such hormones include erythropoietin (Epo), which is known to 

modulate gene transcription (Z.Z.Chong et al: Hematopoietic factor erythropoietin foster 
neuroprotection through novel signal transduction cascades, /. Cereb. Blood Flow Metab., 
22 (2002), 503-514; D.M. Wojchowski and T.C. He: Signal transduction in the 
erythropoietin receptor system, Stem. Cells, 11 (1993), 381-392; J.L. Spivakr.The 
15 mechanism of action of erythropoietin, Int. J. Cell Cloning, 4 (1986), 139-166; and L. 
Mulcahy: The erythropoietin receptor, Semin. Oncol 28 (2001), 19-23). 



Cytokines and growth factors are known to induce signalling cascades that lead to a 
modulation of gene transcription. Thus, as a non-limiting example, the method of the 

20 present invention may also be used to identify agonists or antagonists of cytokines and 
growth factors, and to determine their biological activity. Such cytokines and growth 
factors include, but are not limited to, Interferons, Tumor Necrosis Factor, Vascular 
Endothelial Growth Factor (T. Hanada and A. Yoshimura: Regulation of cytokine 
signaling and inflammation, Cytokine Growth Factor Rev., 13 (2002), 413-421; and Y. Sato 

25 et al.: Properties of two VEGF receptors, Flt-1 and KDR, in signal transduction, Ann. N Y. 
Acad. Sci, 902 (2000), 201-207). 



Further to this, the method of the present invention may be used to determine the 
biological activity of modified or mutated proteins or polypeptides. Modification may be 
30 achieved either chemically or enzymatically. A non-limiting example for this application is 
the determination of the biological activity of different isoforms of pegylated Interferon, 
which is shown in the Figures attached to this application. Thus, the present invention 
provides for determining the relative biological activity of a modified compound, 



comprising measuring the biological activity of the modified compound and the respective 
unmodified compound by the method hereinbefore described, wherein the relative 
transcriptional activity of the modified protein is determined by calculating the ratio of the 
biological activity of the modified compound and the biological activity of the unmodified 
compound. Preferably, the modified compound is a chemically or enzymatically modifed 
compound. Most preferably, the modified compound is a pegylated protein. 

The method of the present invention may also be used for identifying and 
characterizing modulators of enzymes which, as a downstream event, can cause a 
modulation of gene transcription. A non-limiting example for such enzymes are 
phosphodiesterases (PDE), preferably PDE4, most preferably PDE4D isoenzymes. The 
PDE4D isoenzymes are preferably selected from the group consisting of PDE4D1, 
PDE4D2, PDE4D3, PDE4D4, PDE4D5, PDE4D6, PDE4D7 and PDE4D8. PDEs degrade 
cyclic AMP, a second messenger which can modulate gene transcription (G.S. McKnight: 
Cyclic AMP second messenger systems, Curt. Opin. Cell Biol. , 3 (1991) 213-7), and potent 
PDE inhibitors were found to modulate gene transcription (B. Wagner et al.: 7- 
Benzylamino-6chloro-2piperazino-4-pyrrolidino-pteridine > a potent inhibitor of cAMP- 
specific phosphodiesterase, enhancing nuclear protein binding to the CRE sonsensus 
sequence in human tumour cells, Biochemcial Pharmacology* 63 (2002) 659-668; Y. Nishio 
et al.: Cilostazol, a cAMP phosphodiesterade inhibitor, attenuates the production of 
monocyte chemoattractant protein- 1 in response to tumor necrosis factor-alpha in 
vascular endothelial cells, Hormone & Metabolic Research* 29 ( 1997) 491-495). Thus, the 
method of the present invention maybe used for screening of modulators of such 
enzymes. 

Among the compounds that can be identified and analysed are also functional 
antibodies. Functional antibodies are antibodies that either inhibit or stimulate the 
function(s) of the protein that they bind to. This type of antibodies has been raised against 
numerous target proteins, and some of these antibodies are also of therapeutic value. 
Amongst these are, as a non-limiting example, anti-Her2/ErbB2 antibodies, also called 
Herceptin antibodies. These antibodies are being successfully used in the treatment of 
breast cancer. They function by inhibiting the activation of human epidermal growth 
factor receptor Her2/ErbB2, which plays a pivotal role in many types of cancer, including 
breast cancer. Like many other growth factor receptors, Her2, when bound by its ligand(s), 
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mediates signalling including modulation of gene transcription. It has been shown that 
Herceptin is capable of modulating gene transcription in cells that express Her2/ErbB2 
(A.H. Talukder et al.: Antihuman epidermal growth factor receptor 2 antibody herceptin 
inhibits autocrine motility factor (AMF) expression and potentiates antitumor effects of 
5 AMF inhibitors, Clin. Cancer Res.8 (2002), 3285-3289; and R. Mandler et al.: Modification 
in synthesis strategy improve the yield and efficacy of geldanamycin-herceptin 
immunoconjugates, Bioconjugate Chemxstry y 13 (2002), 786-791). Currently, eight different 
Herceptin antibodies exist, and more may still be produced. Generally, in the case of 
functional antibodies, the method of the present invention may be used in determining 
10 which antibody displays the highest biological activity and would be the most suited for 
therapeutic use. Thus, the method of the present invention is also suitable for determining 
the biological activity of antibodies, eg. for the identification of inhibitory or stimulating 
antibodies. In a preferred embodiment, the antibody is an antibody of therapeutic value. 
In a more preferred embodiment, the antibody is a Herceptin antibody. 

15 

For the identification and analysis of a compound that acts indirectly by inhibiting 
or stimulating the biological activity of a second compound, such as a hormone, cytokine, 
growth factor, enzyme etc., the present invention provides a method for identifying 
compounds that indirectly modulate gene transcription comprising determining the 

20 biological activity of a first compound which is a known modulator of gene transcription 
by the methods hereinbefore described in the presence or absence of a second compound, 
wherein a compound which indirectly modulates gene transcription is identified if the 
biological activity of the first compound measured in the presence of said second 
compound is significantly different from the biological activity of the first compound 

25 measured in the absence of said second compound. 

The present invention provides a compound identified by the methods hereinbefore 
described which modulates gene transcription. A use of the methods hereinbefore 
described for identifying a compound which modulates gene transcription is provided as 
30 well. The present invention also provides a kit comprising components for carrying out 
the methods hereinbefore described. Perferably, said components comprise a DNA array, 
more preferably an oligonucleotide array. 
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Furthermore, a pharmaceutical composition comprising a compound identified by 
the methods hereinbefore described and a pharmaceutical^ acceptable carrier is provided. 
Any conventional carrier material can be utilized. The carrier material can be an organic or 
inorganic one suitable for eteral, percutaneous or parenteral administration. Suitable 
carriers include water, gelatin, gum arabic, lactose, starch, magnesium stearate, talc, 
vegetable oils, polyalkylene-glycols, petroleum jelly and the like. Furthermore, the 
pharmaceutical preparations may contain other pharmaceutical^ active agents. Additional 
additives such as flavoring agents, stabilizers, emulsifying agents, buffers and the like may 
be added in accordance with accepted practices of pharmaceutical compounding. 



Additionally, the present invention provides a method for the production of a 
pharmaceutical composition comprising the steps of identifying a compound by the 
method of the present invention, modifying the identified compound and formulating the 
compound obtained with a pharmaceutical^ acceptable carrier or diluent. 



The present invention also provides a use of a compound identified by the methods 
hereinbefore described or a pharmaceutical^ acceptable salt thereof for the preparation of 
a medicament for the treatment of a disease, preferably of a viral disease or of a cancer. 



The term "pharmaceutically acceptable" is employed herein to refer to those 
compounds, materials, compositions, and/or dosage forms which are, within the scope of 
sound medical judgment, suitable for use in contact with the tissues of human beings and 
animals without excessive toxicity, irritation, allergic response, or other problem or 
complication, commensurate with a reasonable benefit/risk ratio. 



As used herein, "pharmaceutically acceptable salts" refer to derivatives of the 
identified agents wherein the parent agent is modified by making acid or base salts thereof. 
Examples of pharmaceutically acceptable salts include, but are not limited to, mineral or 
organic acid salts of basic residues such as amines; alkali or organic salts of acidic residues 
such as carboxylic acids; and the like. The pharmaceutically acceptable salts include the 
conventional non-toxic salts or the quaternary ammonium salts of the parent compound 
formed, for example, from non-toxic inorganic or organic acids. For example, such 
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conventional non-toxic salts include those derived from inorganic acids such as 
hydrochloric, hydrobromic, sulfuric, sulfamic, phosphoric, nitric and the like; and the salts 
prepared from organic acids such as acetic, propionic, succinic, glycolic, stearic, lactic, 
malic, tartaric, citric, ascorbic, pamoic, maleic, hydroxymaleic, phenylacetic, glutamic, 
5 benzoic, salicylic, sulfanilic, 2-acetoxybenzoic, fumaric, benzenesulfonic, toluenesulfonic, 
methanesulfonic, ethane disulfonic, oxalic, isethionic, and the like. 

Also claimed are the methods, compounds, kits and uses as hereinbefore described, 
especially with reference to the foregoing examples. 

10 

Having now generally described this invention, the same will become better 
understood by reference to the specific examples, which are included herein for purpose of 
illustration only and are not intended to be limiting unless otherwise specified, in 
connection with the following figures. 



Brief description of the figures: 
Figure 1: 

20 Analytical IEC-HPLC of 180^ig of PEG-IFN. An analytical strong-cation exchange column 
was used to separate the positional isomers (TOSOH-BIOSEP, SP-5PW, 10 \im particle 
size, 7.5 mm diameter and 7.5 cm length). 

Figure 2: 

25 (A) Specific antiviral activity of the 9 positional isomers and PEG-IFN 

The figure shows the specific antiviral activity [U/|ig] of the positional isomers. The 
antiviral activity was determined in MDBK cells infected with vesicular stomatitis virus. 

(B) Specific antiproliferative activity of the isomers and PEG-IFN 
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This figure indicates the specific inhibition of proliferation [OD 4 90nm/ng] on the ME15 
cells with 100 U/ml of the each positional isomer and PEG-IFN. 

The results present the averages of three assays performed independently. The specific 
antiviral activity and specific antiproliferative activity of PEG-IFN is given for comparison 
5 (hatched line). 

Figure 3: 

Time dependent inhibition of proliferation in a melanoma cell line (ME-15) The ME15 
cell line was cultured in presence of different concentrations (500,125, and 15 U/ml) IFN, 
10 PEG-IFN and isomers over 5 days. The cell proliferation was measured every 24h. The 
control cell density after 5 days was set as 100% as references. The data are derived from 
three independent culture replicates. 

Figure 4: 

15 Correlation of the kinetic dissociation constant and specific antiviral activity of the PEG- 
IFN and isomers with high (K134), intermediate (K131, K164) and low (K121) specific 
antiviral activity. 

Figure 5: 

20 (A) Correlation of specific antiviral activity, specific antiproliferative activity and specific 
transcriptional response of each isomer. The PEG-IFN was set as 100% value reference. 
The hatched line indicates the biological and transcriptional activity of PEG-IFN. The sum 
2 relative mRNA abundance (AD) is illustrated as the sum of all inducible genes for each 
isomer. 

25 (B) Hierarchical clustering of AD of the different general transcriptional gene response 
from 100 U/ml IFN, PEG-IFN and the isomers. The incubation time of the IFN and the 
pegylated variants was 48h; the cell line used is an IFN-sensitive melanoma cell line 
(ME15). The conditions for the data analysis are described in the text. The data of the 
different expression profiles of the genes are reported in Table 2. 
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Figure 6: 

3D-Structure based Model for PEG-IFN mediated interferon responses. Arrangement of 
the positional PEG-IFN isomers in 4 groups according to the specific antiviral and 
5 _ antiproliferation activity. The high resolution-Structure of human interferon alpha-2a was 
determined with NMR spectroscopy (Klaus et al.: The three-dimensional high resolution 
structure of human interferon alpha-2a determined by heteronuclear NMR spectroscopy 
in solution. /. Mol Biol 274 (1997); 661-675). The pegylation sites of interferon alpha-2a 
are colored red. 

10 

Examples 

Antiviral and antiproliferative roles of IFN 

15 Interferon alpha-2a (IFN) plays an important role in mediating antiviral and 

antiproliferative responses and in modulating immune responses (G.R. Stark: How cells 
respond to interferons, Annu. Rev. Biochem. 67 (1998), 227—264). These biological 
activities make IFN important as an inhibitor of disease pathogenesis and useful in the 
treatment of viral infection and some cancers (S. Zeuzem, et al. : Peginterferon alfa-2a in 

20 patients with chronic hepatitis C, New England Journal of Medicine 343 (2000), 1666- 
1172). Since 1986 recombinant human IFN (ROFERON-A) has become an important 
therapeutic agent to treat patients with viral and oncological diseases (S. Zeuzem et al.: 
Peginterferon alfa-2a (40 kDa) monotherapy: a novel agent for chronic hepatitis C 
therapy, Expert Opinion on Investigational Drugs 10 (2001), 2201-2213). 

25 

The success of IFN treatment in the clinic is partly limited due to its short serum 
half-life (4 - 6h). Peak concentrations of IFN occur 3-8 h following iv. or sc. 
administration and large fluctuations in serum concentrations can occur after each dose. 
After 24 h , the amount of exogenous IFN remaining in the body is negligible such that the 
30 thrice-weekly regimen used is unlikely to maintain concentrations required for antiviral 



- 14- 



activity (J.M. Harris et al.: Pegylation: A novel process for modifying pharmacokinetics, 
Clin. Pharmacokinet. 40 (2001), 539-551; P. Bailonet al.: Rational design of a potent, long- 
lasting form of interferon: A 40 kDa branched polyethylene glycol-conjugated Interferon- 
2a for the treatment of hepatitis C Bioconjugate Chem. 12 (2001), 195-202). 



Administration frequency of IFN is dependent on the disease. For example, the 
standard therapeutic application of IFN consists of 3 million international units 
administered subcutaneously three times per week for up to 48 weeks (J.M. Harris et al.: 
Pegylation: A novel process for modifying pharmacokinetics, Clin. Pharmacokinet. 40 
(2001), 539-551), while certain oncological indications may require daily administration 
(P. Bailon et al.: Rational design of a potent, long-lasting form of interferon: A 40 kDa 
branched polyethylene glycol-conjugated Interferon-2a for the treatment of hepatitis C, 
Bioconjugate Chem. 12 (2001), 195-202). In order to improve the pharmacokinetic and 
pharmacodynamic properties of IFN, it was conjugated with poly( ethylene glycol) (PEG). 

Since several years, studies have been carried out to assess the influence of the 
various PEG sizes and PEG types on in vitro and in vivo bioactivity. These results showed 
the important correlation of the PEG characteristics and the biological activity. For 
example, branched PEG moieties may more effectively cloak and protect IFN from 
proteolytic degradation than the linear PEG attachments and may also enhance the 
absorption and distribution properties of protein. Another effect is that increasing the 
molecular weight of the PEG moiety, decreases the clearance of the therapeutic molecule, 
thereby producing sustained exposure to the virus (J.M. Harris et al.: Pegylation: A novel 
process for modifying pharmacokinetics, Clin. Pharmacokinet. 40 (2001), 539-551). Based 
25 on this knowledge coupled with the latest advancements in pegylation procedures, a long- 
lasting, branched monopegylated IFN was developed (P. Bailon et al.: Rational design of a 
potent, long-lasting form of interferon: A 40 kDa branched polyethylene glycol-conjugated 
Interferon-2a for the treatment of hepatitis C, Bioconjugate Chem. 12 (2001), 195-202). 

30 An interferon alpha-2a conjugated via an amide linkage with a branched 40kDa PEG 

(PEG-IFN), exhibits sustained adsorption and reduced renal clearance, resulting in strong 
antiviral pressure throughout a once- weekly dosing schedule (M.C. Perry and B. Jarvis: 
Peginterferon-a-2a (40 kD): A review of its use in the management of chronic hepatitis C, 
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Drugs 15 (2001), 2263-2288; and M.W. Lamb and E.N. Martin: Weight-Based versus fixed 
dosing of peginterferon (40 kDa) alfa-2a, The Annals of Pharmacotherapy 36 (2002), 933- 
938). The characterization of this PEG-IFN mixture indicated 9 positional isomers which 
are monopegylated on lysines: K31, K49, K70, K 80 , Km, K121, Ki3i, K134 and Ki6 4 . All 
5 positional isomers induced viral protection of MDBK cells in the Anti- Viral Assay. 

Example 1: Preparation of PEG-IFN 

The PEG-IFN was manufactured by Hoffmann-La Roche Inc. The pegylated 
10 interferon alpha-2a was prepared by the conjugation of lysine e-amino groups at the 
surface of the interferon molecule with an activated branched polyethylene glycol 
derivative of molecular weight 40000 Da (Bailon, P. et al.: Rational design of a potent, 
long-lasting form of interferon: A 40 kDa branched polyethylene glycol- conjugated 
Interferon-2a for the treatment of hepatitis C. Bioconjugate Chem. 12 (2001); 195-202; 
15 EP809996). The isolation and characterization of the isomers of PEG-IFN is described 
below. All other reagents and chemicals were of the highest purity available. 

PEG-IFN consists of a mixture of monopegylated IFN molecules. In each molecule 
one single lysine is chemically modified by the attachment of a PEG moiety resulting in a 
20 mixture of several species. Due to the presence of eleven lysines and a free N-terminal 
amino group twelve monopegylated species are expected, but only nine lysine-modified 
species could be found (K31, K49, K70, K83, K112, K121, K131, K134 and K164). 

Example 2: Purification and characterization of positional PEG-IFN isomers 

25 

We developed a preparative two step chromatography method allowing efficient 
separation of these species. The purity of the prepared positional isomers could be checked 
by an analytical ion exchange method, see an example for an elution profile in Figure 1. 
Separation is based on local charge differences of the isomers resulting from the different 
30 pegylation sites on the protein moiety. The decrease of the baseline absorption towards the 
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end of the chromatogram suggests that there were no other monopegylated species of IFN 
eluting at higher retention time. By two step separation nine positional isomers were 
purified, the characterization of the isomers indicate the pegylation sides are all on lysines: 
K31, K49, K70, K83, K112, K121, K131, K134 and K164. 



Example 3: Separation of the pos itional isomers 

A two-step isolation and purification scheme was used to prepare the monopegylated 
isoforms of PEG-interferon alpha 2a. 

a) The first step was a separation of the positional isomers on a preparative low 
pressure liquid chromatography column with a weak-cation exchange matrix (TOSOH- 
BIOSEP.Toyopearl CM-650S, Resin Batch no. 82A having a ion exchange capacity of 123 
mEq/ml, the diameter of the column being 16 mm, the length 120 cm). A linear pH- 
gradient of increasing sodium acetate concentration (25 mM, pH 4.0 up 75 mM to pH 7.8) 
was applied at a flow rate of 0.7 mL/min. Detection was at 280 nm. With this 
chromatographic step species 1, 2, 5, 6 and a mixture of 3, 4, 4a, 7 and 8 could be collected, 
see Table 1. 

b) The fractions were further separated and purified in the second preparation step. 
A preparative column with the same matrix as the analytical strong-cation exchange 
column as described above but larger dimensions (30 mm i.d. and 70 mm length), further 
a higher flow rate and an extended run time was used. As for the analytical method the 
column was pre-equilibrated with 3.4 mM sodium acetate, 10% ethanol and 1% diethylene 
glycol, adjusted to pH 4.4 (buffer A). After loading the PEG-IFN samples, the column was 
washed with buffer A, followed by an ascending linear gradient to 10 mM dibasic 
potassium phosphate, 10% ethanol and 1% diethylene glycol, adjusted to pH 6.6 (buffer 
B). The flow rate was 1.0 mL/min and the detection at 218 nm. 



The protein concentration of the PEG-IFN alpha 2a isomer was determined by 
spectrophotometry, based on the 280 nm absorption of the protein moiety of the PEG- 
alpha 2a. 
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An analytical elution profile of 180 fig of PEG-IFN alpha 2a is shown in Fig. 1 . The 
result of this method is a separation into 8 peaks, 2 peaks with baseline separation and 6 
with partial separation. The decrease of the baseline absorption towards the end of the 
5 chromatogram suggests that there were no other monopegylated species of IFN alpha 2a 
eluting at higher retention time. 

In addition, looking carefully at the IEC-chromatogram a further peak close to the 
detection limit is visible between peaks 2 and 3 indicating the presence of additional 
10 positional isomers that should also contribute to the specific activity of the PEG-IFN alpha 
2a mixture. Additional species were expected as the interferon alpha-2a molecule exhibits 
12 sites for pegylation (11 lysines and the N-terminus). However, given the low abundance 
of the these species, they were not isolated and characterised. 

15 Isomer samples derived from IEC optimisation runs were investigated directly after 

the isolation (t — 0) and after 2 of weeks of storage at 5°C. No significant differences were 
observed for the protein derived from IEC-peaks with regard to the protein content as 
determined by spectrometric methods; nor were any changes to be detected in the 
monopegylation site, the content of oligo-PEG-IFN alpha 2a, the amount of aggregates 

20 and the bioassay activity. Taking into account the relative abundance of the individual 
isomers - as determined by the IEC method - as well as the specific activities - as 
determined in the anti-viral assay - almost the total specific bioactivity of the PEG-IFN 
alpha 2a mixture used for their isolation is recovered (approximately 93%). 

25 The analytical IE-HPLC was used to check the purity of the individual isomers with 

respect to contamination with other positional isomers in the IEC fractions. The peaks 2, 
3, 4, 4a, 5 and 7 had more than 98%, the peaks 1 and 8 had 93% and peak 6 had 88 % 
purity. 



30 
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Table 1: 

PEG-peptides identified by comparison of the Lys-C digest spectra of the isomers and the 
reference standard. 



Identified PEG Sites in the separated PEG-IFN Species 



Peak 


missing peaks 


in peptide map 


PEG-IFN 


PEG site 


Mr (DA) 


Sequence 


Peak 1 


K 31 


A, E 


24-49 


Peak 2 


K 134 


i, r 


134-164 


Peak 3 


K 131 


c 


122-131 a 


Peak 4 


K 12I 


B,C 


113-131 


Peak 4a 


K 164 


b 


134-164 a,b 


Peak 5 


K 70 


D,F 


50-83 


Peak 6 


K 83 


D, H 


71-112 


Peak 7 


K 49 


E,F 


32-70 


Peak 8 


K 112 


B, H 


84-121 



3 132-133 too small to detect. 
ab RP-HPLC. 

The fractions were characterised by the methods described herein. 



Example 4: Analysis of the fractions by mass spectrom etry peptide mapping 

Mass spectra were recorded on a MALDI-TOF MS instrument (PerSeptive 
Biosystems Voyager-DE STR with delayed extraction). Each IEC fraction (Ion Exchange 
Chromatography) was desalted by dialysis, reduced with 0.02 M 1 ,4-dithio-DL-threitol 
(DTT) and alkylated with 0.2 M 4-vinyl pyridine. Then the proteins were digested with 
endoproteinase Lys-C (Wako Biochemicals) in 0.25 M Tris (tris(hydroxymethyl)- 
aminoethane) at pH 8.5 with an approximate enzyme to protein ratio of 1:30. The reaction 
was carried out over night at 37 °C. 
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A solution of 20 mg/ml a-cyano-4-hydroxycinnamic acid and 12 mg/ml 
nitrocellulose in acetone/isopropanol 40/60 (v/v) was used as matrix (thick-layer 
application). First, 0.5 pL of matrix was placed on the target and allowed to dry. Then, 1.0 
pL of sample was added. The spectra were obtained in linear positive ionisation mode with 
5 an accelerating voltage of 20.000 V and a grid voltage of 95 %. At least 190 laser shots 

covering the complete spot were accumulated for each spectrum. Des-Arg ! -bradykinin and 
bovine insulin were used for internal calibration. 

Example 5: High-performance liquid chromatography ( RP-HPLO Peptide Mapping 

10 

The peptides were characterized by reverse-phase high-performance liquid 
chromatography (RP-HPLC) Peptide Mapping. The IEC fractions were reduced, alkylated 
and digested with endoproteinase Lys-C as described for the MALDI-TOF MS peptide 
mapping. The analysis of the digested isomers was carried out on a Waters Alliance HPLC 
15 system with a Vydac RP-C18 analytical column (5 pm, 2.1 x 250 mm) and a precolumn 
with the same packing material. Elution was performed with an acetonitrile gradient from 
1 % to 95 % for 105 min in water with a flow rate of 0.2 mL/min. Both solvents contained 
0.1 % (v/v) TFA. 100 pL of each digested sample were injected and monitored at 215 nm. 

20 Example 6: MALDI-TOF spectra of undigested protein 

An 18 mg/ml solution of trans-3-indoleacrylic acid in acetonitrile/0.1 % trifluoroacetic 
acid 70/30 (v/v) was premixed with the same volume of sample solution. Then 1.0 pL of 
the mixture was applied to the target surface. Typically 150 - 200 laser shots were averaged 
25 in linear positive ionisation mode. The accelerating voltage was set to 25.000 V and the grid 
voltage to 90 %. Bovine albumin M + and M 2+ were used for external calibration. 

Example 7: SE-HPLC (size exclusion HPLC) 

30 SE-HPLC was performed with a Waters Alliance 2690 HPLC system equipped with a 

TosoHaas TSK gel G 4000 SWXL column (7.8 x 300 mm). Proteins were eluted using a 
mobile phase containing 0.02 M NaH 2 P0 4 , 0.15 M NaCl, 1 % (v/v) diethylene glycol and 
10 % (v/v) ethanol (pH 6.8) at a flow rate of 0.4 mL/min and detected at 210 nm. The 
injection amounts were 20 pg of each isomers. 
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Size Exclusion HPLC and SDS-PAGE were used to determine the amount of oligo-PEG- 
IFN alpha 2a forms and aggregates in the different IEC fractions. The reference material 
contains 2.3 % aggregates and 2.2 % oligomers. 

Peaks 1, 4, 4a, 5, 6 and 8 contain < 0.7 % of the oligopegylated I FN alpha 2a forms, whereas 
5 in peaks 2, 3, and 7 the percentage of the oligopegylated I FN alpha 2a forms are under the 
detection limit (< 0.2 %). In the case of the aggregates a different trend could be seen. In all 
peaks the amount of aggregates is below 0.9 %. 

Example 8: SDS-PAGE 

10 . 

SDS-PAGE was carried out both under non-reducing and under reducing conditions 
using Tris-Glycine gels of 16 % (1.5 mm, 10 well). Novex Mark 12 molecular weight 
markers with a mass range from 2.5 to 200 kDa were used for calibration, bovine serum 
albumin (BSA) was used as sensitivity standard (2 ng). Approximately 1 jig of all the 
15 samples and 0.5 jig of standard were applied to the gel. The running conditions were 125 V 
and 6 W for 120 min. The proteins were fixed and stained using the silver staining kit 
SilverXpress from Novex. 

The gels that were recorded under non-reducing conditions for the IEC fractions 1 - 
20 8 show a pattern that is comparable to that of the PEG-IFN alpha 2a reference standard. 

Under reducing conditions, the gels show an increase in intensity of the minor bands at 
about 90 kDa as compared to the standard. Between 6 and 10 kDa protein fragments 
appear for peaks 6, 7 and 8. Both bands together correspond to approximately 1 % of 
clipped material. In the lanes of isomer 1, 5, 6, 7, 8 additional bands with more than 100 
25 kDa can be seen which are also present in the standard. These can be assigned to 
oligomers. Thus SDS-PAGE confirms the results of the SE-HPLC analysis. 

Overall, RP-HPLC and SDS-PAGE experiments indicate that the purity of the IEC 
fractions can be considered comparable to the PEG-IFN alpha 2a reference standard. 

30 The structure of the PEG-IFN alpha 2a species derived from the 9 IEC- fractions were 

identified based on the results of the methods described above using the strategy 
mentioned above. 
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Example 9: In Vitro Antiviral Protection Activity 



The antiviral activities of the isolated isomers were determined by the reduction of 
the cytopathic effect of vesicular stomatitis virus (VSV) on Madin-Darby bovine kidney 
5 (MDBK) cells. The cell lines were cultured in MEM-medium containing 10% fetal bovine 
serum. The isomers and PEG-IFN were diluted to a final concentration of 10 ng/mi (assay 
starting concentration) for the detection of the effective concentration. This effective 
concentration (EC 50 ) was determined as the concentration needed for 50% protection of 
the cells against the VSV. In all measurements, the cell culture was challenged with VSV 4 
10 hours after addition of the PEG-IFN isomers. After 18 hours growth at 37°C the cells were 
stained with crystal violet* to determine the number of intact cells. 

To determine the antiviral activity of the 9 positional isomers the EC 5 o-values were 
measured. PEG-IFN was tested in this assay as a reference. 

15 

Figure 2A shows the specific activities (U/|ig, grey columns) of the isomers . In this 
assay all positional isomers had activity, however at a different level. The isomers K31 with 
170% and K134 with 125% display higher values than PEG-IFN. K164 was equal to the 
mixture whereas the activities of K49, K70, K83, K112, K121 and K131 were lower. 



Example 10: Antiproliferative Activity Assay. 



For this assay, a primary cell culture of human melanoma metastases cells (ME15) 
was used. This cell line was deposited on August 17, 2000 under the terms of the Budapest 

25 Treaty at the Collection Nationale de Cultures de Microorganismes, Institut Pasteur, Paris 
under the registration number CNCM 1-2546. The cell line was cultured in RPMI medium 
supplemented with 10% FCS, 2 mM glutamine, ImM sodium pyruvate, 10 mM non- 
essential amino acids and HEPES buffer (all from Gibco Life Sciences, UK)* When 
confluent, the cells were passaged by trypsinization. Approximately 5000-6000 cells per 

30 well in a flat bottom 96 well plate were used as a start point for the cell proliferation assay 
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in the presence or in the absence of the individual concentrations of IFN, PEG-IFN and 
the isomers over a period of a 5 day period. The number of living cells was then 
determined using a cell staining kit (Promega, Madison, USA) based on the colorimetric 
detection of the cleavage of the tetrazolium salt MTS into formazan. The MTS reaction 
5 solution was added according to the manufacturer's recommendation for a period of 4 
hours at 37°C, after this period the absorption at 490 nm was recorded in a 
spectrophotometer. 

The specific antiproliferative activity of the isomers was determined by using an 
10 IFN- sensitive melanoma cell line (ME15) and a commercial assay. Positional isomers and 
PEG-IFN as references were used at a concentration of 100 U/ml (Figure 2B). The 
biological unit (U) for the isomers which was used for the experiments was defined and 
determined in the antiviral assay hereinbefore described, with PEG-IFN as standard 
reference. 

15 

In this assay all isomers were active with the isomer K31 at 159% and K134 at 115% 
relative to PEG-IFN. K164 displayed the same activity as PEG-IFN; all other isomers were 
less active than the reference. 



20 The results rank the activity essentially in the same order as in the antiviral assay 

indicating that same activities are derived from a common trigger or signalling event. 

One explanation for the different specific activities of the nine positional isomers in 
the two previous experiments could be their different protein stability. To find out this we 

25 tested some isomers at different concentrations in a time course experiment using the 
ME15 cell line. We selected isomers K31 and K134 which have the highest activity, and 
K121, which is the isomer with the lowest activity, together with IFN and PEG-IFN as "j 
references (see Figure 3). Different concentrations of IFN, PEG-IFN and positional 
isomers (500, 125 and 15 U/ml) were used for the treatment of the ME15 cells over a 

30 period of 5 days. Every 24h the number of cells was measured. 
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Again we found a striking correlation to the antiproliferative activity described above 
with K134 being most active followed by K31 and PEG-IFN. As expected the lowest 
activity was found for K121 and IFN. 

5 It can be observed that the inhibition of proliferation is decreased by diminishing 

concentration. Interestingly, PEG-IFN used at a concentration of 500 U/ml exhibits 
identical antiproliferative activity as compared to the isomer K134 used at 125 U/ml after 5 
days, while IFN and K121 exhibit lower or no antiproliferative activity after 5 days when 
used at a concentration of at least 125 U/ml. 

10 

In three independent biological assays we established a correlation between the 
intramolecular attachment site of the PEG moiety and the biological activity. In summary, 
the isomers K31 and K134 showed consistently higher specific activities than the 
commercial mixture, while the remaining seven isomers had a similar specific activity 
15 compared to the commercial mixture (K164). Activities range between 170% and 20% 
indicating that the difference in specific activities of the protein between the positional 
isomers is significant. 

i 
i 

Example 11: Expression and purification of the recombinant extracellular domain of the 
20 Interferon Receptor 2 from Rcoli : 

The extracellular domain of the Interferon Receptor 2 (IFNR2-EC) was cloned into a 
pET15b-vector (Invitrogen, US) with an N-terminal His-Tag®. This vector construct was 
used to express IFNR2-EC in Rcoli. The target protein precipitated in the cells in the form 

25 of inclusion bodies. The Exoli pellet after the fermentation in full medium was 
resuspended in lysis buffer containing 50 mM Tris, 10% saccharose, 0.02% sodium azide, 
10 mM magnesium chloride and EDTA-free protease inhibitor. After the homogenisation 
in a French Press and centrifiigation (13000 rpm) the inclusion bodies were solubilized in 
8 M urea. The first chromatography step was a nickel chelate chromatography with a linear 

30 gradient from 0-500 mM imidazol over 10 column volumes. The pooled peak of IFNR2 
was dialysed against 8 M urea to remove the imidazol. After this dialysis the sample was 
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diluted to 0.1 A 28 owith 8 M urea containing 150 mM Tris/HCL pH 8.0 and stored at room 
temperature until the Ellman assays were negative. The oxidised IFNR2-EC was dialysed 
against 50 mM Tris pH 8.4 for refolding. 50 mM Tris pH 8.4 was also the starting buffer 
for the second chromatography. The second chromatography step was with an anion ion- 
exchange resin (HiTrap Q HP, 5x5ml, Amersham Biosciences) with a gradient of 0-500 
mM NaCl. 



All purification steps were analysed by SDS-PAGE under reduced and non-reduced 
conditions. The protein purity was monitored by analytical size-exclusion 
chromatography using Superdex* 75 and 200 HR (PC 3.2/30, Amersham Biosciences), 
This chromatography method was also used to determine both the binding activity and 
capacity (data not shown). 

Exam ple 12: Characterisation of the interaction of TFN and the PEG-IFN isomers with the 
IFNR2-EC using an optical bio sensor system: 

The immobilisation of the IFNR2-EC was achieved by labelling with biotin. The 
biotinylated protein was anchored to a streptavidin sensor surface (SA Sensor, BIAcore). 

Biotinylation was performed by diluting the protein stock solution (10 mM acetate, 
pH 7.5) with Hepes (10 mM, pH = 8.4) to a final protein concentration of 0.23 mg/ml 
protein (8^65 uM). A 8.65 mM solution of biotinamidocaproate-n-hydroxysuccinimide 
ester (Fluka) in DMSO was added to the solution to give a final concentration of the 
biotinylating reagent of 86.5 uM. The mixture was allowed to react for lh before it was 
dialysed over night against a Hepes buffer (10 mM, pH 8.4, 150 mM NaCl) to remove low 
molecular weight reaction products. This solution was used in the immobilisation 
procedure. Immobilisation was achieved by contacting the solution of the biotinylated 
protein with the SA sensor surface. The amount of immobilised protein was controlled via 
the contact time. For the binding experiments 4000 RU, corresponding to approximately 4 
ng/mm 2 of protein were immobilised. 
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Binding experiments were performed in Hepes buffer (10 mM Hepes, 150 mM NaCl, 
3 mM EDTA, 0.005% polysorbate 20, pH 7.4) as the running buffer. Interferon solution 
was prepared by diluting stock solutions of the proteins with running buffer. The flow 
velocity was set to 5 |il/min for ail binding experiments. The interferon solutions were 
5 contacted for a time interval of 5 min with the immobilised IFNR2-EC. 



Calculation of binding data: dissociation constant (K D ) values were determined by 
fitting the time dependent response curves with a mathematical fit based on the simple 
kinetic modelof the formation of a 1/1 complex. ko n and koff values were obtained from this 
10 fit. Kd represents the ratio koff/kon 



Investigation of the interaction of proteins by label-free solid phase detection such as 
surface plasmon resonance (SPR) requires the immobilisation of one of the interacting 
compounds. 

15 

We used two different coupling strategies to immobilise IFN or IFNR2-EC to the 
sensor surface. The IFN was immobilized via free amino groups on a carboxylated dextran 
sensor surface, the IFNR2-EC was labeled with biotin and anchored to a streptavidin 
sensor surface. With these two sensors and the corresponding IFN respectively IFNR2-EC 
20 binding test were done. Consistent values of rate constants and binding affinities were 
obtained. The results indicate that the binding affinity is independent of the 
immobilization technique and the immobilized protein (data not shown). These 
preliminary tests were necessary to increase the accuracy for the measurements with the 
isomers and to verify the simple kinetic model of 1/1 complex formation. 

25 

For the determination of the affinity and kinetic constants we selected five isomers 
(K31, K121, K131,K134 and K164) to cover the whole activity range of the specific 
biological activity (see Figure 2 A,B). To increase the significance of the kinetic and 
equilibrium parameters, we performed binding studies on the immobilized IFNR2-EC 
30 with different concentrations of the isomers. The IFN and PEG-IFN were used as a 
reference. The result of the binding curve fits based on a kinetic model for a formation of a 
1/1 complex are given in Table 2. 
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kon 


k 0 ff 


KDkin 




[10 5 M-1xs-1] 


[10' 2 s-1] 


[riM] 


IFN 


12.5 


1.14 


9.12 


1 » — V_J II 1 M 


0.27 


2.3 


851 




n.d. 


n.d. 


n.d. (> 1mM) 


Kl34 


0.87 


3.96 


455 


K131 


0.96 


2.8 


289 


K121 


0.83 


2.33 


279 


Ki64 


2.19 


0.652 


335 



Table 2: Kinetic constants for the interaction of wild-type IFN; PEG-IFN and several 
positional isomers 

5 The table shows the measured affinity and kinetic data. The calculation for the 

kinetic constant is based on the kinetic model of a 1/1 complex formation. The IFNR2-EC 
was labelled with biotin. This biotinylated protein was anchored to a streptavidin sensor 
surface (SA Sensor, Biacor). Binding experiments were performed in Hepes buffer (10 mM 
Hepes, 150 mM NaCl, 3 mM EDTA, 0.005% polysorbate 20, pH 7.4) as the running buffer. 

10 

The obtained K D -values indicate that the pegylation of IFN causes a decrease in the 
affinity towards the INFR2-EC. The affinity deficit is dependent on the position of 
pegylation. The affinity change is generally due to a change in the association constants 
(kon). For instance, isomer K134 changes the affinity roughly by a factor of 15..ko n is 
15 reduced by a factor of more than 10, whereas k<>fr is increased only by a factor of 
approximately 4. 
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For the isomer K31 only an estimation of the K D was possible because the protein 
concentrations were too low. 

Combining the data from the specific antiviral or specific antiproliferative activity 
5 with the IFNR2-EC affinity study an interesting correlation was observed as seen in Figure 
4. It indicates that the higher the IFNR2-EC affinity (low K D ) of a given isomer, the lower 
the specific biological activity. This correlation is opposed to the common drug screening 
dogma which is principally used to search ligands which have a low K D and a high 
biological activity. 

10 

Example 13: Transcriptional response of the isomers compared to IFN 

DNA microarrays which permit simultaneous measurement of the expression levels 
of thousands of genes, provide a comprehensive frame-work to determine how the 
15 positional isomers affect cellular metabolism and regulation on a genomic scale. 

To estimate the influence of the pegylation site on the transcriptional activity of IFN 
we measured the IFN-induced gene expression pattern of the melanoma cell line MElS.via 
the oligonucleotide array technology 

20 

ME15 cells were incubated for 48h with 100 U/ml of IFN, PEG -I FN or PEG-IFN 
isoforms. After 48h the cultured cells were harvested by scraping and the total cellular 
RNA was extracted with RNA-Bee™. From each sample 10 |ig of the total cellular RNA 
were reverse transcribed (Invitrogen, U.S.), labelled (Ambion, U.S.) and processed by 
25 using commercial kits according to the supplier's instructions. The methods of the alkaline 
heat fragmentation and the following hybridisation of the cDNA with the U95Av2 
GeneChip arrays were standard procedure provided by the manufacturer of the 
microchips (Affymetrix, U.S.). A further example for a procedure for performing 
oligonucleotide array analysis can be found in Fambrough et al„ Diverse signaling 
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pathways activated by growth factor receptors induce broadly overlapping, rather than 
independent, sets of genes, Cell 97 (1999), 727-741. 



The raw data of the arrays were measured with a confocal laser scanner (Hewlett 
Packard, U.S.) and pixel levels were analysed using GeneChip v3.1 software (Affymetrix, 
U.S.). The expression level for each gene was calculated as normalised average difference of 
fluorescence intensity as compared to hybridisation to mismatched oligonucleotides, 
expressed in arbitrary units (AD). Each investigation of the samples was measured in 
triplicate with labelled cRNA from three different cell plates. 



The following four criteria were chosen for differential expression of a gene after 
treatments with IFN, PEG-1FN or the PEG-IFN isoforms compared to the untreated cells 
were: (1) The intensity value of the treated cells is at least two fold higher than the 
untreated cells. (2) The intensity value in the treated cells should be higher or at least 50 
AD. (3) The intensity value of the untreated cells should be lower than 50 AD. ( 4) The 
standard deviation has to be significantly smaller than the absolute change in average 
difference and the calculated confidence level of a gene is set greater than 97% (p value < 
0.03). 



For these experiments we used the same cell line (ME 15) as for the antiproliferative 
assay. We treated the cells for 48h with 100 U/ml of each isomer or with IFN and PEG- 
IFN. IFN was used as the reference. After this treatment total RNA was isolated from the 
ME- 15 cells. The methods for preparation of cRNA and subsequent steps leading to 
hybridization and scanning of the U95Av2 GeneChip Array are described above. Visual 
inspection of total RNA samples before processing by agarose gel electrophoresis did not 
reveal any signs of degradation or yield in the samples. The number of biological replicates 
for each isomer, IFN and PEG-IFN (n=3) allows the application of statistical algorithms 
for data analysis in order to account for experimental variation. The data analysis focused 
on genes which were lower than 50 AD in the untreated cells but at least 2-fold 
upregulated with an AD value of at least 50 in the treated cells. 
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The standard deviation had to be significantly smaller than the absolute change in 
average difference and the calculated confidence level of a gene was set to greater than 97% (p 
value < 0.03). 



5 By applying these criteria, an expression profile of 29 upregulated genes was 

observed. In this expression pattern 13 known IFN-inducible genes which are involved in 
the antiproliferative and antiviral pathway were highly upregulated as compared to IFN 
(Table 3). The hierarchical cluster (Figure 5B) illustrates that all isomers have 
transcriptional activity but generate different intensities. 

10 

The sum of AD (E AD) of the induced genes was determined for each isoform and 
IFN, and the transcriptional, antiviral and proliferative activities compared with IFN as a 
reference. Figure 5 A shows that isoforms with an increased biological activity, as 
determined by their antiviral and antiproliferative activities as compared to PEG-IFN, also 
15 have an increased transcriptional activity, while isoforms with a lower biological activity 
display a lower transcriptional activity, and that thus, in contrast to biological activity vs. 
binding assays, a positive correlation exists between biological and transcriptional activity. 



Beside the effect of the isomers on the general transcriptional gene response, a varied 
20 upregulation of individual genes was also observed. For example, the gene for interferon- 
induced nuclear phosphoprotein shows higher expression upon exposure to K49 and K31 
than with K134 or K131. K49 and K31 isomers have a lower general transcriptional gene 
response than K134 or K131. This observation emphasizes the necessity of analysing the 
general transcriptional gene response, as determined by the method of this invention, 
25 rather than to measure transcriptional expression levels of an individual gene, or a small 
selection of genes, for the determination of the transcriptional activity of a compound of 
interest. 



The data of the microarray chips were also analyzed for transcriptional-specific side- 
30 effects which may indicate undesirable effects on the metabolism or vitality of cells by any 
isomer. The criteria for these analyses were similar to those described before, the only 
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difference was that the data for each isomer were cross-compared. The result of this 
analysis showed no specific side-effect of any positional isomer. 
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Table 3: IFN modulated genes dependent on the pegylation side 

To identify induced genes the ME15 cell line were cultured 48 hours in absence 
(untreated cell) or in presence of 100 U/ml IFN or a PEG-IFN sample. The expression level 
for each gene was calculated as normalised average difference of fluorescence intensity as 
compared to hybridisation to mismatched oligonucleotide, expressed in arbitrary units. 
Each investigation of the samples were measured in triplicate with labelled cRNA from 
three different cell plates. The criteria for a differentially expression of a gene by treatments 
with IFN or a PEG-IFN sample compared to the untreated cells were: if the mean value 
increase was at least twofold higher as in the untreated cells and the main value in the 
treated cells should be higher or at least 50 units. The mean value of the untreated cells 
should be lower than 50 units. In addition, the standard deviation had to be significantly 
smaller than the absolute change in average difference and the calculated confidence level 
of a gene was set greater than 97% (p value < 0.03). By applying these criteria, 29 genes 
were found to be modulated. 
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Claims 

1. A method for determining the biological activity of a compound which can modulate 
gene transcription, comprising the steps of 

a) contacting a host with a compound; 

5 b) determining the general transcriptional gene response of the host; and 

c) quantitating the general transcriptional gene response induced by said compound. 

2. The method of claim 1, wherein the general transcriptional gene response is 
determined by a DNA array technology. 

10 

3. The method of claim 2, wherein the DNA array technology is an oligonucleotide array 
technology. 

4. The method of any one of claims 1 to 3, wherein the compound is a modified 
15 compound. 

5. The method of claim 4, wherein the modified compound is a modified protein. 

6. The method of claim 5, wherein the modified protein is a pegylated protein. 

20 

7. The method of claim 6, wherein the pegylated protein is a specific isolated isoform of a 
pegylated protein. 
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8. The method of claims 6 or 7> wherein the pegylated protein is Epo or INF. 
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9. The method of any one of claims 1 to 8 wherein the transcriptionally regulated genes 
are one or more genes selected from the group consisting of Seq ID No. 1 to 29. 

10. A method for determining the relative biological activity of a modified compound, 
comprising measuring the biological activity of the modified compound and the 
respective unmodified compound by the method of any one of claims 1 to 7, wherein 
the relative transcriptional activity of the modified protein is determined by calculating 
the ratio of the biological activity of the modified compound and the biological activity 
of the unmodified compound. 

11. The method of claim 8, wherein the modified compound is a chemically or 
enzymatically modifed compound. 

12. The method of claim 9, wherein the modified compound is a pegylated protein. 

13. A method for identifying compounds that indirectly modulate gene transcription 
comprising determining the biological activity of a first compound which is a known 
modulator of gene transcription by the method of any one of claims 1 to 9 in the 
presence or absence of a second compound, wherein a compound which indirectly 
modulates gene transcription is identified if the biological activity of the first 
compound measured in the presence of said second compound is significantly 
different from the biological activity of the first compound measured in the absence of 
said second compound. 

14. The method of claim 13 wherein the known modulator of gene transcription is Epo or 
IFN. 
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15. A compound identified by the methods of claims 1 to 14 which modulates gene 
transcription. 

16. Use of the method of claims 1 to 14 for identifying a compound which modulates gene 
5 transcription. 

17. A kit comprising components for carrying out the methods according to claims 1 to 
14. 

10 18. The kit of claim 17 wherein said components comprise a DNA array. 

19. A pharmaceutical composition comprising a compound of claim 15 and a 
pharmaceutical^ acceptable carrier. 

15 20. Use of a compound of claim 15 or a pharmaceutical^ acceptable salt thereof for the 

preparation of a medicament for the treatment of a disease, preferably a viral disease or 
of a cancer. 

21. The methods, compounds, kit and uses as hereinbefore described, especially with 
20 reference to the foregoing examples. 
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Abstract 2 1 Mar? 20fl3 

The present invention provides a novel method for determining the biological activity of a 
compound comprising the steps of a) contacting a host with a compound; b) determining 
the general transcriptional gene response of the host; and c) quantitating the general 
transcriptional gene response induced by said compound. 
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SEQUENCE LISTING ' * 2003 

<110> F. Hoffmann-La Roche AG 

<120> Transcriptional Activity Assay 

<130> 21574 

<160> 29 

<170> Patentln version 3.1 

<210> 1 

<211> 597 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inter feron-alpha inducible gene; ISG12 gene 

<222> (1) . . (597) 

<223> GenBank accession No. X67325 



<400> 1 

agctgaagtt gaggatctct tactctctaa gccacggaat taacccgagc aggcatggag 60 

gcctctgctc tcacctcatc agcagtgacc agtgtggcca aagtggtcag ggtggcctct 12 0 
ggctctgccg tagttttgcc cctggccagg attgctacag ttgtgattgg aggagttgtg . 180 

gccatggcgg ctgtgcccat ggtgctcagt gccatgggct tcactgcggc gggaatcgcc 240 

tcgtcctcca tagcagccaa gatgatgtcc gcggcggcca ttgccaatgg gggtggagtt 3 00 

gcctcgggca gccttgtggg tactctgcag tcactgggag caactggact ctccggattg 360 

accaagttca tcctgggctc cattgggtct gccattgcgg ctgtcattgc gaggttctac 420 

tagctccctg cccctcgccc tgcagagaag agaaccatgc caggggagaa ggcacccagc 480 

catcctgacc cagcgaggag ccaactatcc caaatatacc tgggtgaaat ataccaaatt 540 

ctgcatctcc agaggaaaat aagaaataaa gatgaattgt tgcaactctt aaaaaaa 597 
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ctgcatctcc agaggaaaat aagaaataaa gatgaattgt 



tgcaactctt aaaaaaa 



597 



<210> 2 

5 <211> 996 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> BST-2 

10 <222> (1) . . (996) 

<223> GenBank accession No. D28137 



<400> 2 

gtggaattca tggcatctac ttcgtatgac tattgcagag tgcccatgga agacggggat 6 0 

15 aagcgctgta agcttctgct ggggatagga attctggtgc tcctgatcat cgtgattctg 120 

ggggtgccct tgattatctt caccatcaag gccaacagcg aggcctgccg ggacggcctt 180 

cgggcagtga. tggagtgtcg caatgtcacc catctcctgc aacaagagct gaccgaggcc 240 

cagaagggct ttcaggatgt ggaggcccag gccgccacct gcaaccacac tgtgatggcc 30 0 

ctaatggctt ccctggatgc agagaaggcc caaggacaaa agaaagtgga ggagcttgag 360 

20 ggagagatca ctacattaaa ccataagctt caggacgcgt ctgcagaggt ggagcgactg 420 

agaagagaaa accaggtctt aagcgtgaga atcgcggaca agaagtacta ccccagctcc 480 

caggactcca gctccgctgc ggcgccccag ctgctgattg tgctgctggg cctcagcgct 540 

ctgctgcagt gagatcccag gaagctggca catcttggaa ggtccgtcct gctcggcttt 600 

tcgcttgaac attcccttga tctcatcagt tctgagcggg tcatggggca acacggttag 660 

25 cggggagagc acggggtagc cggagaaggg cctctggagc aggtctggag gggccatggg 7 20 

gcagtcctgg gtgtggggac acagtcgggt tgacccaggg ctgtctccct ccagagcctc 7 80 
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cctccggaca atgagtcccc cctcttgtct cccaccctga gattgggcat ggggtgcggt 840 

gtggggggca tgtgctgcct gttgttatgg gttttttttg cggggggggt tgcttttttc 900 

tggggtcttt gagctccaaa aaataaacac ttcctttgag ggagagcaaa aaaaaaaaaa 960 

aaaaaaaaaa aaaaaaaaaa aaagaattcc accaca 996 

5 



<210> 3 
<211> 6803 
<212> DNA 
10 <213> Homo sapiens 
<220> 

<221> interferon-inducible peptide (6-16) gene 
<222> (1) . . (6803) 

<223> GenBank accession No. U22970 

15 

<400> 3 

agatctatca tgatggccac atgaacacag gcttcactgg tcttaccata tgcccatgac 6 0 

ccagaagcag ccagcctgag agaacaatgg aacaaagtga cttaatttcc aaagttctgg 120 
gggtttataa acagcatatg gtatgtcaca tgactactgt gggattggaa aaagatcatg 180 

20 taaatagagc actttgcaca gaccctggca tgcagcaagt gctcaataaa tgataggtgt 240 
tgtttactaa ttggactgaa tggtgaaagg cctgtgtgcc ccagggggag ctggtgatca 300 
ggcttcacta agcccagtat ggccgtggct ctcatctcag tgtgacatgc tttgaatacc 360 
cttagcggct ccaaaagtcc tcagcttgaa gtgcattttt ctgccagcag gcagcacaca 420 
aatgttccgc tgggcggagc tgggagagag gggaaaatga aactctgcag agtgcaggag 480 

25 ctgggagaga ggggaaaatg aaactgcaga gtgcagaaat agaaactccg acagggattg 540 
gctgcctagg gtgagacgtg ggaggatcca caagtgatga taaaaagcca gccttcagcc 600 
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ggagaaccgt 


ttactcgctg 


ctgtgcccat 


ctatcagcag 


gctccgggct 


gaagattgct 


6 60 




tctcttctct 


cctccaaggt 


aaactcagga 


gcttatgaag 


tgtgggcatt 


caagctgcca 


"~i o n 

720 




ccctctgcca 


ggctgcctgt 


ctgcctgtaa 


atctcatgtt 


ctgagagcca 


ggaggcccct 


Ton 




tctcctggga 


ggcagcactc 


ctgggtccct 


tttagtgctc 


tgggctggga 


cttgtctaag 


840 


5 


aggatgggtt 


ggagattttt 


agggagatgg 


gatgcaaaac 


cccaagtggc 


atgagaccca 


900 




gcttacaggt 


gcaatatcag 


cgatctgtgg 


ccttaacact 


gtcacctctt 


ggagccttaa 


960 




ttacttcctc 


tgtaaaagga 


aagttaagtt 


gcctttgctg 


ctcaaaggac 


tggagtattt 


1020 




tagaatcccc 


tttattatag 


gatccaatgt 


gacagtgagt 


tcatctttaa 


ggatagatga 


1080 




agccatctgg 


agtggttgtt 


aaaaatgtgg 


gcttggtggc 


caggcgtggt 


ggttcacgcc 


1140 


10 


tgtaatccca 


gcactttggg 


aggccgagac 


ccacggatca 


cgaggtcagg 


agatcgagac 


1200 




catcctggct 


aacacagtga 


aaccctgtct 


ctactaaaaa 


tacaaaaaaa 


attagccggg 


1260 




cttggtggtg 


ggcgcctgta 


gtctcagcta 


ctcgggaggc 


tgaggcaggc 


aggagaatgg 


1320 




catgaacctg 


ggaggtggag 


cttgcagtga 


gatgagattg 


cgccactgca 


ctccagcctg 


1380 




ggcgacagag 


aaagactcca 


tctcaaaaaa 


aaaaaaaaaa 


aaaatgtggg 


cttgggtatt 


1440 


15 


agccaattgt 


gggtttaaac 


cctgactttg 


ccatgacctg 


gctaggcagc 


cttgggaata 


1500 




ttacttaacc 


tctctgagct 


tcagtttcct 


tatctataac 


atgtggttga 


taacaatggt 


1560 




acctacttcg 


tggtgttgtt 


ataggtacaa 


gtggactgcg 


tgccataaag 


tgctaagtac 


1620 




tcaatacatg 


agaggagttt 


cttgtttttc 


tttttaactt 


aagaaaatag 


cattgatctg 


1680 




acaaagatca 


tattagaggt gagtctgagg gcatatgcat tttccagttt tgtggtataa 


1740 


20 


aggaaagagc 


ataggttcat 


gaattctgac 


tgtcttgaat 


gtgtacccat 


gttgcacaat 


1800 




ttgctgtgtg 


acctttatcc 


cttgtgatat 


tgtaaaatat 


attttttcat 


cccccagttc 


1860 




ctggcataca 


actcctaaaa 


tccttggaat 


cttcaaagtg 


ataaatatct 


tttgagtgat 


1920 




aactgtctaa 


taagttgact 


gatggctggc 


atttcctctg 


gggaggggag 


aggggctgaa 


1980 




ggttaagctg 


attaccaatg 


ggcagtgatt 


taatcagtca 


tgcctatgta 


atgaagcctc 


2040 


25 


cataaaaaca 


caaaaggaca 


gggtttgaca 


agcttccaga 


tagccgaata 


caggaggtct 


2100 




ccagtgggtg 


gtgcttctgg 


agaggaatgg 


aaactccatg 


ccctttctcc 


cataccttgc 


2160 
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cctatgcatc tcttcatctg gagcatttgt aatatccata aggaaccagt aaatgtagct 2220 

gggtatggtg gctgatgcct gtaatctcag cactttggga ggctgaggca ggcagattgc 2280 

gtgagttcag gagttcaaga tcagcctggg caacatggtg aaaccccatc tcaaaaaaaa 2340 

aaaaaaaaat tagttgggcg tggtggtgca tacctgtagt cccagctatc tggaggctga 2400 

5 ggtgggagga tcacttgagc ctgggaggcg aaggtctcag tgagctgaga tggggccact 2460 

gcactccagc ctgagcaata gagtgagacc ctgtctcaaa acaacaacaa gatcaatttt 2520 

ttttttttaa agaaagaaac cagtatatgt aagtaaagtg tttccctgag ttctgtgagc 2580 

tgctctagca aattagtaaa actgcaggag ggaattgtgg gaaccccagt ttatagtcag 2640 

tcagaaacac aggtaaaaca acctagggct tcgattggca gtctggggga ctgaaccctc 2700 

10 aacatgtggg atcggccact atctacccat agtggtgtca gaattgaatt tggggaccct 2760 

cagctggtgt tcaccagagg actgattgtt tgcttgctgg tggggagaaa cccccacact 282 0 

tctgctgtca gaagcctctc gtacggtgag ggaaactgag tttctttttc cacacaccct 2880 

tctgtgcttt ctttctgaaa aaaaaattaa gttgatgtga ggccaacgtt tggtgccagt 2940 

agatattcaa tgcattgtaa gaatgagttt gtcttaaaaa tttttttaaa gcaaattcta 3000 

15 tgcgtcaggc actgttcaaa gtgcgggata cagcgtggaa cagttattaa attaacaagt 3060 

accctctgaa atagactcta ttaatacccc aattttagaa ataaacaaac tgaagcccag 3120 

agaagcaaag taccctagta tcctgcccaa ggtcacttgg gtacaaggcc aagaacaacc 3180 

acagtcatga tattgacatc tgcggacgtt accacgtgcc aaacaccgtg cgaatccgca 3240 

cagaatccgc acacccaccg gttggggcag ggacatctct gccactgatg cggagaggaa 3300 

20 acggattccc gaggctcctg agagggatcc tgggttccat tcaccacacc tcgctgcttt 3360 

ttactgattc agttggaact ggaagggaaa acaacggagc tgatgagctc caaaaatgct 3420 

ccggggacca tctccctctc gcccgttcgc aggtctagtg acggagcccg cgcgcggcgc 3480 

caccatgcgg cagaaggcgg tatccgtttt cttgtgctac ctgctgctct tcacttgcag 3540 

tggggtggag gcaggtgaga atgcgggtaa ggatgcaggt aaggggacag gtaagggtgc 3600 

25 aggtaaggat gcaagtaagg gtgcaggtaa ggatgcgggt aaggatgcag gtaaagatgc 3 6 60 

gggtaaggat gcaggtaagg atgcgggtaa ggatgcaggt aagggtgcag gtaaggatgc 3720 



gggtaaggat gcaggtaagg atgcaggtaa 
gggtaaggat gcaggtaagg gtgcaggtaa 
aagtaaggag gcatataagg gtgctggggg 
ttcagggtcc ggggtgtaga gggtctggcg 
5 aatggacagc gcagggttgg tgtcgccaga 
attcctccat cctaccgaaa tttactcgat 
gaaagtccct tcccctctgt gaggctctct 
gagtggtcct gagcatcaga cacagagtag 
actacgcttt gtctgctctc ctgcaggtaa 

10 ctccgggttc tggaaggccc tgaccttcat 
aggatctcgg gcaggcgggg cgggcctctg 
ctccaggact cgcagtcgcc gggctgcccg 
ccaactcggt ggctgcctcg ctgatgagct 
ccgccggggg gctagtggcc acgctgcaga 

15 gggacgttct ttactattat tttcatcata 
aagtaattta ctcaagtaat taagtagttt 
attttgagac agagtctggc tgtgtcaccc 
ggctcactgc aacctctgcc tccctggctc 
agctgggatt acaggcacac gccaccacgc 

20 ggggtttcac catgttggcc aggctggctc 
tcggcgtccc atagtgctag gattacaggc 
taacttactc aagtctccct gctagtttag 
caatgcgaga agccccattc tcagcctcaa 
aatcacaagc ctcagttccc tcagcggtgc 

25 tttgaatctg ctcaatctca gagaccacag 
tcaatctcac accaaactgg tcttgacttg 



-39- 

gggtgcaggt aaggatgcag gtaaggatgt 3780 

gggaacagat aagggtgcag gtaaggggac 3840 

atgggaggag. tgtgggataa aggagagagt 3900 

tgcttctgta gagaccctgg cgtccattcc 3960 

gtcccagatt caaatcccac tactctgttc 4020 

gctagttctg tgccagctcg gcgcctaggg 4080 

gagtaggaga gtggtggccc ccgcctccag 4140 

gggacccctg tgtcccaccc caacaacccc 42 00 

gaaaaagtgc tcggagagct cggacagcgg 4260 

ggccgtcgga ggaggtgggt ctggagggcg 4320 

ccgcggacgc tccctcacct gctcctgttc 4380 

cgctgggctt caccggcgcc ggcatcgcgg 4440 

ggtctgcgat cctgaatggg ggcggcgtgc 4500 

gcctcggtga gtgcggggcc tgggcctggt 4560 

catctgggga aactgaggca ctcaggaatt 4620 

accaggaatt aagtgattct ttatttattt 4680 

aggcaggctg aagtgcagtg gtgtgatctt 4740 

aagcgatttt cctgccacag cctcccaagt 4800 

ccggctaatt tttgtatttt tag.tagagat .4860 

aaactcctga cctcagctga tccgcccgcc 4920 

gtgagtcacc gcgtccggcc agaaattaag 4980 

accagagcct agattctgac ccagtcagta 5040 

aggctgggtg cctcactgat gctttggcca 5100 

gtggggatga tcatcctcag tttaacattt 5160 

tagcaaagaa acatagtaga gtaactcaaa 5220 

cactttttga acaccatgga gggaacagat 5280 



-40- 

aacagagctc acagtgactc ttctatcgat tgatcaatct attgatctat ttattataga 5340 

taaaggtcaa aacaaggtct cacactgcca ggttggaatg cagtgcatga tcacagctca 5400 

cagcagcctc gaactcccag gttcaagcta tcttcctgcc tcagattccc aagtagctgg 5460 

gactataggc gcataccgcc acacactggc tttttttttt tttttttttt ttttttttgt 5520 

5 agagacgggg tctcattgtg ttgcccagga tggtcttgaa ttcctgggct caagcaatcc 5580 

tcccactttg gcctcccaaa gtgttgagac tgcaagcatg agccactgtg ctggcccaga 5640 

gtgactcata aaaaatggcc ttacttccct ctctctcctc tctccccacc cacccccacc 5700 

tctctctcgc gctctgaggc ctccaaaatc ctggagaaaa cctgcccctg acaaacttcc 5760 

ctctctgcct ttctgaacct cgcatctcct cttctctcaa ttctgaatgg caaaagccca 5820 

10 aagaaccagc ccaaaagaag agagccctgc tcagacgggg ccacacccct gcaatgggag 5880 

gggaagagtg tgggcgagcc cagggacacc tctgggctat cagctagttg tcctctcaca . 5940 

tggcacatag ctgtgacaag atgaaatgat ccctggcttg tgaacccccc acacccacac 6000 

agggctccat gaaggtggct gttagcatga ttaataacat ggcgtttcca tggcagacat 6060 

caggtgaagg agccattatt tttgcctttt tcatcactga aacacatctc tgatttggga 6120 

15 tgtcaaggca aggtaaaaat gggactcact ttgtagtagt gccatcccct ctcagggcct - 6180 

ctgtttctcc atctagacat aaggggttgg actcaaccaa tggtttccag acatttggat 6240 

tcccaggcct ggcacttggg cccagtgact catctgtgtt tgctttacag gggctggtgg 6300 

cagcagcgtc gtcataggta atattggtgc cctgatgcgg tacgccaccc acaagtatct 6360 

cgatagtgag gaggatgagg agtagccagc agctcccaga acctcttctt ccttcttggc 642 0 

20 ctaactcttc cagttaggat ctagaacttt gccttttttt tttttttttt ttttttttga 6480 

gatgggttct cactatattg tccaggctag agtgcagtgg ctattcacag atgcgaacat 6540 

agtacactgc agcctccaac tcctagcctc aagtgatcct cctgtctcaa cctcccaagt 6600 

aggattacaa gcatgcgccg acgatgccca gaatccagaa ctttgtctat cactctcccc 6660 

aacaacctag atgtgaaaac agaataaact tcacccagaa aacactttgt cctgctgtca 6720 

25 atcatgtttg cagtgagaag cccaaaacaa tctggctctg gcctgcacca tccacacacc 6780 

cccattccca ttttcccctc gag 6803 



-41 - 



<210> 4 

<211> 634 

5 <212> DNA 

<213> Homo sapiens 
<220> 

<221> interferon- induced 17-kDa/15-kDa protein 

<222> (1)..(634) 

10 <223> GenBank accession No. M13755 

<400> 4 

cggctgagag gcagcgaact catctttgcc agtacaggag cttgtgccgt ggcccacagc 60 

ccacagccca cagccatggg ctgggacctg acggtgaaga tgctggcggg caacgaattc 120 

15 caggtgtccc tgagcagctc catgtcggtg tcagagctga aggcgcagat cacccagaag 180 

attggcgtgc acgccttcca gcagcgtctg gctgtccacc cgagcggtgt ggcgctgcag 240 

gacagggtcc cccttgccag ccagggcctg ggccctggca gcacggtcct gctggtggtg 300 

gacaaatgcg acgaacctct gagcatcctg gtgaggaata acaagggccg cagcagcacc 360 

tacgaggtcc ggctgacgca gaccgtggcc cacctgaagc agcaagtgag cgggctggag 420 

20 ggtgtgcagg acgacctgtt ctggctgacc ttcgagggga agcccctgga ggaccagctc 



ggaggcggca cagagcctgg cgggcggagc taagggcctc caccagcatc cgagcaggat 
caagggccgg aaataaaggc tgttgtaaga gaat 



480 



ccgctggggg agtacggcct caagcccctg agcaccgtgt tcatgaatct gcgcctgcgg 540 



600 
634 



25 



<210> 5 



-42- 

<211> 2058 

<212> DNA 

<213> Homo sapiens 

<220> 

5 <221> mRNA expressed in osteoblast 

<222> (1)..(2058) 

<223> GenBank accession No. AB000115 



<400> 5 

10 gcacgaggaa gccacagatc tcttaagaac tttctgtctc caaaccgtgg ctgctcgata 60 

aatcagacag aacagttaat cctcaattta agcctgatct aacccctaga aacagatata 120 

gaacaatgga agtgacaaca agattgacat ggaatgatga aaatcatctg cgcaactgct 180 

tggaaatgtt tctttgagtc ttctctataa gtctagtgtt catggaggta gcattgaaga 240 

tatggttgaa agatgcagcc gtcagggatg tactataaca atggcttaca ttgattacaa 3 00 

15 tatgattgta gcctttatgc ttggaaatta tattaattta cgtgaaagtt ctacagagcc 360 

aaatgattcc ctatggtttt cacttcaaaa gaaaaatgac accactgaaa tagaaacttt 420 

actcttaaat acagcaccaa aaattattga tgagcaactg gtgtgtcgtt tatcgaaaac 480 

ggatattttc attatatgtc gagataataa aatttatcta gataaaatga taacaagaaa 540 

cttgaaacta aggttttatg gccaccgtca gtatttggaa tgtgaagttt ttcgagttga 600 

20 aggaattaag gataacctag acgacataaa gaggataatt aaagccagag agcacagaaa 66 0 

taggcttcta gcagacatca gagactatag gccctatgca gacttggttt cagaaattcg 720 

tattcttttg gtgggtccag ttgggtctgg aaagtccagt tttttcaatt cagtcaagtc 780 

tatttttcat ggccatgtga ctggccaagc cgtagtgggg tctgatacca ccagcataac 840 

cgagcggtat aggatatatt ctgttaaaga tggaaaaaat ggaaaatctc tgccatttat 900 

25 gttgtgtgac actatggggc tagatggggc agaaggagca ggactgtgca tggatgacat 960 

tccccacatc ttaaaaggtt gtatgccaga cagatatcag tttaattccc gtaaaccaat 1020 



-43- 

tacacctgag cattctactt ttatcacctc tccatctctg aaggacagga ttcactgtgt 1080 

ggcttatgtc ttagacatca actctattga caatctctac tctaaaatgt tggcaaaagt 1140 

gaagcaagtt cacaaagaag tattaaactg tggtatagca tatgtggcct tgcttactaa 1200 

agtggatgat tgcagtgagg ttcttcaaga caacttttta aacatgagta gatctatgac 1260 

5 ttctcaaagc cgggtcatga atgtccataa aatgctaggc attcctattt ccaatatttt 1320 

gatggttgga aattatgctt cagatttgga actggacccc atgaaggata ttctcatcct 1380 

ctctgcactg aggcagatgc tgcgggctgc agatgatttt ttagaagatt tgcctcttga 1440 

ggaaactggt gcaattgaga gagcgttaca gccctgcatt tgagataagt tgccttgatt 1500 

ctgacatttg gcccagcctg tactggtgtg ccgcaatgag agtcaatctc tattgacagc 1560 

10 ctgcttcaga ttttgctttt gttcgttttg ccttctgtcc ttggaacagt catatctcaa 1620 

gttcaaaggc caaaacctga gaagcggtgg gctaagatag gtcctactgc aaaccacccc 1680 

tccatatttc cgtaccattt acaattcagt ttctgtgaca tctttttaaa ccactggagg 1740 

aaaaatgaga tattctctaa tttattcttc tataacactc tatatagagc tatgtgagta 1800 

ctaatcacat tgaataatag ttataaaatt attgtataga catctgcttc ttaaacagat 1860 

15 tgtgagttct ttgagaaaca gcgtggattt tacttatctg tgtattcaca gagcttagca 1920 

cagtgcctgg taatgagcaa gcatacttgc cattactttt ccttcccact ctctccaaca 1980 

tcacattcac tttaaatttt tctgtatata gaaaggaaaa ctagcctggg caacatgatg 2040 

aaaccccatc tccactgc 2058 

20 

<210> 6 
<211> 2607 
<212> DNA 
<213> Homo sapiens 
25 <220> . 

<221> transcription factor ISGF-3 



<222> 
<223> 



-44- 

(1) . . (2607) 

GenBank accession No. M97936 



<400> 6 

5 attaaacctc tcgccgagcc cctccgcaga 

gccatcctcg agagctgtct aggttaacgt 

ttggcaccta acgtgctgtg cgtagctgct 

ggcacaaggt ggcaggatgt ctcagtggta 

ggagcaggtt caccagcttt atgatgacag 

10 acagtggtta gaaaagcaag actgggagca 

ccgttttcat gacctcctgt cacagctgga 

taacttcttg ctacagcata acataaggaa 

ggaagaccca atccagatgt ctatgatcat 

tctggaaaac gcccagagat ttaatcaggc 

15 gttagacaaa cagaaagagc ttgacagtaa 

tatagagcat gaaatcaaga gcctggaaga 

aaccttgcag aacagagaac acgagaccaa 

acagctgtta ctcaagaaga tgtatttaat 

caaaataata gagttgctga atgtcactga 

20 actagtggag tggaagcgga gacagcagag 

cttggatcag ctgcagaact ggttcactat 

gcagcttaaa aagttggagg aattggaaca 

aaaaaacaaa caagtgttat gggaccgcac 

ctcgtttgtg gtggaaagac agccctgcat 

25 gaagacaggg gtccagttca ctgtgaagtt 

ttataatttg aaagtcaaag tcttatttga 



ctctgcgccg gaaagtttca tttgctgtat 60 

tcgcactctg tgtatataac ctcgacagtc 120 

cctttggttg aatccccagg cccttgttgg 180 

cgaacttcag cagcttgact caaaattcct 240 

ttttcccatg gaaatcagac agtacctggc 300 

cgctgccaat gatgtttcat ttgccaccat 360 

tgatcaatat agtcgctttt ctttggagaa 420 

aagcaagcgt aatcttcagg ataattttca 480 

ttacagctgt ctgaaggaag aaaggaaaat 540 

tcagtcgggg aatattcaga gcacagtgat 600 

agtcagaaat gtgaaggaca aggttatgtg 660 

tttacaagat gaatatgact tcaaatgcaa 720 

tggtgtggca aagagtgatc agaaacaaga 780 

gcttgacaat aagagaaagg aagtagttca 840 

acttacccag aatgccctga ttaatgatga 900 

cgcctgtatt ggggggccgc ccaatgcttg 960 

agttgcggag agtctgcagc aagttcggca 1020 

gaaatacacc tacgaacatg accctatcac 1080 

cttcagtctt ttccagcagc tcattcagag 1140 

gccaacgcac cctcagaggc cgctggtctt 12 00 

gagactgttg gtgaaattgc aagagctgaa 1260 

taaagatgtg aatgagagaa atacagtaaa 13 2 0 



aggatttagg aagttcaaca ttttgggcac 
caccaatggc agtctggcgg ctgaatttcg 
tggcaccaga acgaatgagg gtcctctcat 
tgaaacccaa ttgtgccagc ctggtttggt 

5 tgtggtgatc tccaacgtca gccagctccc 
catgctggtg gcggaaccca ggaatctgtc 
ggctcagctt tcagaagtgc tgagttggca 
tgtggaccag ctgaacatgt tgggagagaa 
tctcattccg tggacgaggt tttgtaagga 

10 gctttggatt gaaagcatcc tagaactcat 
tgggtgcatc atgggcttca tcagcaagga 
gccggggacc ttcctgctgc ggttcagtga 
atgggtggag cggtcccaga acggaggcga 
gaagaaagaa ctttctgctg ttactttccc 

15 tgctgagaat attcctgaga atcccctgaa 
tgcctttgga aagtattact ccaggccaaa 
ccctaaagga actggatata tcaagactga 
cacagaagag tgacatgttt acaaacctca 
gaagatgctt gtattttact tttccattgt 

20 tgagatcccc gtgttactgc ctatcagcat 
aaaaaccaaa tttgtattta aggtatataa 
gtataaataa aatgagcaaa agttgaa 
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gcacacaaaa gtgatgaaca tggaggagtc 1380 

gcacctgcaa ttgaaagaac agaaaaatgc 1440 

cgttactgaa gagcttcact cccttagttt 1500 

aattgacctc gagacgacct ctctgcccgt 1560 

gagcggttgg gcctccatcc tttggtacaa 1620 

cttcttcctg actccaccat gtgcacgatg 1680 

gttttcttct gtcaccaaaa gaggtctcaa 1740 

gcttcttggt cctaacgcca gccccgatgg 1800 

aaatataaat gataaaaatt ttcccttctg 1860 

taaaaaacac ctgctccctc tctggaatga 1920 

gcgagagcgt gccctgttga aggaccagca 1980 

gagctcccgg gaaggggcca tcacattcac 2040 

acctgacttc catgcggttg aaccctacac 2100 

tgacatcatt cgcaattaca aagtcatggc 2160 

gtatctgtat ccaaatattg acaaagacca 2220 

ggaagcacca gagccaatgg aacttgatgg 2280 

gttgatttct gtgtctgaag tgtaagtgaa 2340 

agccagcctt gctcctggct ggggcctgtt 2400 

aattgctatc gccatcacag ctgaacttgt 2460 

tttactactt taaaaaaaaa aaaaaaagcc 2520 

attttcccaa aactgatacc ctttgaaaaa 2580 

2607 



25 <210> 7 

<211> 1584 



-46- 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> ISGF3-gamma 

5 <222> (1) . . (1584) 

<223> GenBank accession No. M87503 



<400> 7 

gatcagaggg cgatcagctg gacagcaact caggatggca tcaggcaggg cacgctgcac 60 

10 ccgaaaactc cggaactggg tggtggagca agtggagagt gggcagtttc ccggagtgtg 120 

ctgggatgat acagctaaga ccatgttccg gattccctgg aaacatgcag gcaagcagga 180 

cttccgggag gaccaggatg ctgccttctt caaggcctgg gcaatattta agggaaagta 240 

taaggagggg gacacaggag gtccagctgt ctggaagact cgcctgcgct gtgcactcaa 300 

caagagttct gaatttaagg aggttcctga gaggggccgc atggatgttg ctgagcccta 360 

15 caaggtgtat cagttgctgc caccaggaat cgtctctggc cagccaggga ctcagaaagt 420 

accatcaaag cgacagcaca gttctgtgtc ctctgagagg aaggaggaag aggatgccat 480 

gcagaactgc acactcagtc cctctgtgct ccaggactcc ctcaataatg aggaggaggg 540 

ggccagtggg ggagcagtcc attcagacat tgggagcagc agcagcagca gcagccctga 600 

gccacaggaa gttacagaca caactgaggc cccctttcaa ggggatcaga ggtccctgga 660 

20 gtttctgctt cctccagagc cagactactc actgctgctc accttcatct acaacgggcg 720 

cgtggtgggc gaggcccagg tgcaaagcct ggattgccgc cttgtggctg agccctcagg 780 

ctctgagagc agcatggagc aggtgctgtt ccccaagcct ggcccactgg agcccacgca 840 

gcgcctgctg agccagcttg agaggggcat cctagtggcc agcaaccccc gaggcctctt 900 

cgtgcagcgc ctttgcccca tccccatctc ctggaatgca ccccaggctc cacctgggcc 960 

25 aggcccgcat ctgctgccca gcaacgagtg cgtggagctc ttcagaaccg cctacttctg 1020 

cagagacttg gtcaggtact ttcagggcct gggcccccca ccgaagttcc aggtaacact 1080 
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gaatttctgg gaagagagcc atggctccag ccatactcca cagaatctta tcacagtgaa 1140 
gatggagcag gcctttgccc gatacttgct ggagcagact ccagagcagc aggcagccat 1200 
tctgtccctg gtgtagagcc tgggggaccc atcttccacc tcacctcttt gttcttcctg 1260 
tctcctttga agtagactca ttcttcacac gattgacctg tcctctttgt gataattctc 1320 
5 agtagttgtc cgtgataatc gtgtcctgaa aatcctcgca cacactggct ggtggagaac 1380 
tcaaggctaa ttttttatcc tttttttttt tttatttttg agatatacgc ccfcctttcat 1440 
ctgtaaggga ctaggaaatt ccaaatggtg tgaacccagg gggcctttcc ctcttccctg 1500 
acctcccaac tctaaagcca agcactttat atttttctct tagatattca ctaaggactt 1560 
aaaataaaat ttttttgaaa gagg 1584 

10 

<210> 8 
<211> 483 
<212> DNA 
15 <213> Homo sapiens 
<220> 

<221> hepatitis C-associated micro tubular aggregate 
<222> (1) . . (483) 

<223> GenBank accession No. D28915 
20 <400> 8 

cctctggttg cctttcctga gataatccac taagaatatt ttgtgtttct tttctcaggg 60 
aatctaaggg aggaaattat caactgtgca caaggaaaaa aatagatatg tgaaaggttc 120 
acgtaaattt cctcacatca cagaagatta aaattcagaa aggagaaaac acagaccaaa 180 
gagaagtatc taagaccaaa gggatgtgtt ttattaatgt ctaggatgaa gaaatgcata 240 
25 gaacattgta gtacttgtaa ataactagaa ataacatgat ttagtcataa ttgtgaaaaa 300 
taataataat ttttcttgga tttatgttct gtatctgtga aaaaataaat ttcttataaa 360 



-48- 

actcgggtct aacttgagag tgtgtgtgat tttggaaaaa ttatgatttg tcagcatctt 420 

ctgatattca ctgctttcat cttaattttg ccttctgatt ttatttctaa agtatgtgat 480 

ttt 483 



<210> 9 

<211> 1642 

<212> DNA 

<213> Homo sapiens 
10 <220> 

<221> interferon-inducible 56 Kd protein_ ( IFIT-1 ) 

<222> (1) . . (1642) 

<223> GenBank accession No. M24594 



15 <400> 9 

ccagatctca gaggagcctg gctaagcaaa accctgcaga acggctgcct aatttacagc 60 

aaccatgagt acaaatggtg atgatcatca ggtcaaggat agtctggagc aattgagatg 120 

tcactttaca tgggagttat ccattgatga cgatgaaatg cctgatttag aaaacagagt 180 

cttggatcag attgaattcc tagacaccaa atacagtgtg ggaatacaca acctactagc 24 0 

20 ctatgtgaaa cacctgaaag gccagaatga ggaagccctg aagagcttaa aagaagctga 300 

aaacttaatg caggaagaac atgacaacca agcaaatgtg aggagtctgg tgacctgggg 360 

caactttgcc tggatgtatt accacatggg cagactggca gaagcccaga cttacctgga 420 

caaggtggag aacatttgca agaagctttc aaatcccttc cgctatagaa tggagtgtcc 480 

agaaatagac tgtgaggaag gatgggcctt gctgaagtgt ggaggaaaga attatgaacg 540 

25 ggccaaggcc tgctttgaaa aggtgcttga agtggaccct gaaaaccctg aatccagcgc 600 

tgggtatgcg atctctgcct atcgcctgga tggctttaaa ttagccacaa aaaatcacaa 660 



-49- 

gccattttct ttgcttcccc taaggcaggc tgtccgctta aatccagaca atggatatat 720 

taaggttctc cttgccctga agcttcagga tgaaggacag gaagctgaag gagaaaagta 780 

cattgaagaa gctctagcca acatgtcctc acagacctat gtctttcgat atgcagccaa 840 

gttttaccga agaaaaggct ctgtggataa agctcttgag ttattaaaaa aggccttgca 900 

5 ggaaacaccc acttctgtct tactgcatca ccagataggg ctttgctaca aggcacaaat 960 

gatccaaatc aaggaggcta caaaagggca gcctagaggg cagaacagag aaaagctaga 1020 

caaaatgata agatcagcca tatttcattt tgaatctgca gtggaaaaaa agcccacatt 1080 

tgaggtggct catctagacc tggcaagaat gtatatagaa gcaggcaatc acagaaaagc 1140 

tgaagagaat tttcaaaaat tgttatgcat gaaaccagtg gtagaagaaa caatgcaaga 1200 

10 catacatttc tactatggtc ggtttcagga atttcaaaag aaatctgacg tcaatgcaat 1260 

tatccattat ttaaaagcta taaaaataga acaggcatca ttaacaaggg ataaaagtat 132 0 

caattctttg aagaaattgg ttttaaggaa acttcggaga aaggcattag atctggaaag 13 80 

cttgagcctc cttgggttcg tctataaatt ggaaggaaat atgaatgaag ccctggagta 1440 

ctatgagcgg gccctgagac tggctgctga ctttgagaac tctgtgagac aaggtcctta 1500 

15 ggcacccaga tatcagccac tttcacattt catttcattt tatgctaaca tttactaatc 1560 

atcttttctg cttactgttt tcagaaacat tataattcac tgtaatgatg taattcttga 1620 

ataataaatc tgacaaaata tt 1642 



20 <210> 10 

<211> 853 

<212> DNA 

<213> Homo sapiens 
<220> 

25 <221> interferon-inducible protein 9-27 

<222> (1) . . (853) 



- 50- 

<223> GenBank accession No. J04164 
<400> 10 

ctagtcctga cttcacttct gatgaggaag cctctctcct tagccttcag cctttcctcc 60 
5 caccctgcca taagtaattt gatcctcaag aagttaaacc acacctcatt ggtccctggc 120 
taattcacca atttacaaac agcaggaaat agaaacttaa gagaaataca cacttctgag 180 
aaactgaaac gacaggggaa aggaggtctc actgagcacc gtcccagcat ccggacacca 240 
cagcggccct tcgctccacg cagaaaacca cacttctcaa accttcactc aacacttcct 300 
tccccaaagc cagaagatgc acaaggagga acatgaggtg gctgtgctgg gggcaccccc 3 60 

10 cagcaccatc cttccaaggt ccaccgtgat caacatccac agcgagacct ccgtgcccga 420 
ccatgtcgtc tggtccctgt tcaacaccct cttcttgaac tggtgctgtc tgggcttcat 480 
agcattcgcc tactccgtga agtctaggga caggaagatg gttggcgacg tgaccggggc 540 
ccaggcctat gcctccaccg ccaagtgcct gaacatctgg gccctgattc tgggcatcct 600 
catgaccatt ggattcatcc tgtcactggt attcggctct gtgacagtct accatattat 660 

15 gttacagata atacaggaaa aacggggtta ctagtagccg cccatagcct gcaacctttg 720 
cactccactg tgcaatgctg gccctgcacg ctggggctgt tgcccctgcc cccttggtcc 780 
tgcccctaga tacagcagtt tatacccaca cacctgtcta cagtgtcatt caataaagtg 840 
cacgtgcttg tga 853 

20 

<210> 11 
<211> 1377 
<212> DNA 
<213> Homo sapiens 
25 <220> 

<221> 2-5A synthetase induced by interferon 



-51 - 



<222> (1)..(1377) 
<223> X04371 

<400> 11 

5 aacgaaacca acagcagtcc aagctcagtc agcagaagag ataaaagcaa acaggtctgg 60 

gaggcagttc tgttgccact ctctctcctg tcaatgatgg atctcagaaa taccccagcc 120 

aaatctctgg acaagttcat tgaagactat ctcttgccag acacgtgttt ccgcatgcaa 180 

atcaaccatg ccattgacat catctgtggg ttcctgaagg aaaggtgctt ccgaggtagc 240 

tcctaccctg tgtgtgtgtc caaggtggta aagggtggct cctcaggcaa gggcaccacc 3 00 

10 ctcagaggcc gatctgacgc tgacctggtt gtcttcctca gtcctctcac cacttttcag 3 60 

gatcagttaa atcgccgggg agagttcatc caggaaatta ggagacagct ggaagcctgt 420 

caaagagaga gagcattttc cgtgaagttt gaggtccagg ctccacgctg gggcaacccc 480 

cgtgcgctca gcttcgtact gagttcgctc cagctcgggg agggggtgga gttcgatgtg 540 

ctgcctgcct ttgatgccct gggtcagttg actggcagct ataaacctaa cccccaaatc 600 

15 tatgtcaagc tcatcgagga gtgcaccgac ctgcagaaag agggcgagtt ctccacctgc 660 

ttcacagaac tacagagaga cttcctgaag cagcgcccca ccaagctcaa gagcctcatc 720 

cgcctagtca agcactggta ccaaaattgt aagaagaagc ttgggaagct gccacctcag 780 

tatgccctgg agctcctgac ggtctatgct tgggagcgag ggagcatgaa aacacatttc 840 

aacacagccc agggatttcg gacggtcttg gaattagtca taaactacca gcaactctgc 900 

20 atctactgga caaagtatta tgactttaaa aaccccatta ttgaaaagta cctgagaagg 960 

cagctcacga aacccacgcc tgtgatcctg gacccggcgg accctacagg aaacttgggt 1020 

ggtggagacc caaagcgttg gaggcagctg gcacaagagg ctgaggcctg gctgaattac 1080 

ccatgcttta agaattggga tgggtcccca gtgagctcct ggattctgct ggtgagacct 1140 

cctgcttcct ccctgccatt catccctgcc cctctccatg aagcttgaga catatagctg 1200 

25 gagaccattc tttccaaaga acttacctct tgccaaaggc catttatatt catatagtga 1260 

caggctgtgc tccatatttt acagtcattt tggtcacaat cgagggtttc tggaattttc 1320 



- 52- 

acatcccttg tccagaattc attcccctaa gagtaataat aaataatctc taacacc 



1377 



«210> 12 

5 <211> 5503 

<212> DNA 

<213> Homo sapiens 

<220> 

10 <221> Wilms' tumor gene 

<222> (1)..(5503) 

<223> GenBank accession No. X69950 
<220> 

15 <221> misc_f eature 

<222> (2041) . . (2041) 

<223> undefined nucleotide 

<220> 

20 <221> misc^feature 

<222> (2083) . . (2083) 

<223> undefined nucleotide 

<220> 

25 <221> misc_feature 

<222> (2186) . . (2186) 



<223> 



unde f ined nuc 1 eo t ide 



-53 



<220> 

<2 21> misc^feature 

5 <222> (2187) . . (2187) 

<223> undefined nucleotide 



<220> 

<2 21> misc_feature 

10 <222> (2488) . . (2488) 

<223> undefined nucleotide 



<220> 

<221> misc_feature 

15 <222> (3386) (3386) 

<223>. undefined nucleotide 

<220> 

<221> misc_feature 

20 <222> (3718) . . (3718) 

<223> undefined nucleotide 



<220> 

<2 21> misc_feature 

25 <222> (3720) . . (3720) 

<223> undefined nucleotide 



-54- 



<220> 

<221> misc_feature 

<222> (3938) . . (3938) 

5 <223> undefined nucleotide 

<400> 12 

ggtgcgctca ggactggaag caactctcag 
tctggccacg gcctggcctc catccccact 

10 atcctctata taattagttc cagaatttaa 
gccgcggaat cctttcactg aggactctcg 
cagaacagac gagtaacccc agcggctcca 
tttgcaccgc attcggcagt acctaacggt 
agttaggaca gtaagtgggg actggttaaa 

15 agaagctggg agtagttatc acaaagcccc 
gaatgacgtt gaaattttca ccgccagcgc 
tgcagccccg agtgggcaga ttccgaagag 
gttgggctgt cttgcgcgtg gggctcgggg 
agcttgttcc tctcccgcga caggcagcag 

20 cgggccccgc gcgcagtgtg gctgtccccg 
ttattcagtt gtccgcggcc gctgggtgac 
tccctggggt tgcggctgta gcagcccagc 
ccggggtgaa agtaagggtt gagaaatcct 
tttcatattg caataattat gcaccttcga 

25 caacgccaca tggttttcca ggggctgtcc 
gtagaggggt ggtgggtaaa acctccaact 



ttcatcaatt tcatgggcct acgtatctgc 60 

gtggcattaa tgcagaacaa attgagacat 120 

aaagatttac ggtataataa tagaggcagt 180 

gggagctcgg ggaccatctc gcggtggcat 240 

ggcactgcga actgggggag agggggaaat 300 

tccgggctcc cagatagggt cagcgaggag 3 60 

aaaaatcgac ttaattttga aatgatcaga 420 

tcccttgccc cccccttttt ttaattgaag 480 

ggggctggga accatcctaa acacttgccc 540 

ttggagtcgg cggggagggg aggaacagct 600 

cgcgggcccc ggggagaggc gctagtcgaa 660 

cgaggtcgag ccactcttta ttacggcctg 720 

cccttgaccg accgcagcgc tttggggtgt 780 

tcccggaggc ggcccaggcg tgcgcactgg 840 

tccgcgctct gtcagatgca gtggacagcg 900 

cactgccctg ctcctaccca gcctcgattt 960 

ggccgggatg cctgcgcttt gccagtgtat 1020 

tcgcctctgc atcccattag ctctgaagag 1080 

ggacgttgag agacccgggt ttcagctcgg 1140 



cgttgccact aacttgtttc ctgtccttaa 
attgagtcaa aggcccacac tgctctccta 
gcgctagtat cccattatcc cgtttgacca 
gaactcgtct agggtcacac agctgttggt 
5 gccaccaacc ccttgctctt tacgtgcgct 
gcctcagttt ccttctctgt aaaacgagaa 
tggttcagac caggaacctc tgctacagat 
gaaagaaggg ggcgcccagt tttccacagc 
ggcctcgcga gctaaaggag gtacaggaga 
10 tacctgcctt cgtgctaggc tgtgagtcct 
tagctggcat gtcccccttg gaaaacctca 
aagaagagga cagagaggca tggagcgccc 
cgctggctta gtgtcgcctg ggcttccggg 
ttcctgtcgg gtccctgggg tcctccgact 
15 gccccgcagg ctgcagggaa ctcctcccac 
-agggggcacc ccggggttcg caccggtgct 
agaaaataaa tggcagagga gagaggagtt 
tccaccccta catccctcac cccggnncaa 
tttacgtgtc tttgccttac ctgggtttca 
20 tcccttcaga taacctcctc tccccccgct 
aatctcaatc ttttcgtttg gaaatgaaag 
cctcacagtt gagtcctagt taagaaaaac 
aaggaaggga gatacaggga tctggggngt 
cgagagtcct ttctccactc aaaaaaccaa 
25 agatcgtgtg ggagaggact gagcagagag 
tttcttaata ataattttaa aagcttctga 
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acagtagtag taacaaaaac tgtaaagttg 12 00 

tctaatcctc agaacaactc tatgaggtgg 12 60 

gtaaggaaac tgaggctcag agagactgcg 1320 

tacctggcag agctgaggtt ccattctggg 1380 

ccatagactg cctctccttc ggctgtctgg 1440 

gtttaatctg cagctttctt tgccaatcgg 1500 

ctcggcgctg acaggggtaa gaaggtgggt 1560 

ctgcttttct ctgggacctc gcgaagggcg 1620 

gcgcctatcg tccgcggcgg gtgaaggtgc 1680 

ggtgcttagc tcagggcgcc aaggccagtg 1740 

ggtctcccgc agagaacgtt acccacaaca 1800 

tgcgactgca ggagtacgtc agttccccag 1860 

catgtggatc cgttggggtc gtacggagac 1920 

gcggctcctc agcttagcac tttcttcttg 1980 

ctctttagtc ggagaagtcc aagtcgggcg 2040 

cttcccctcc ccncccccac aaggattctg 2100 

ctacatttgc ttggctctcc tttcctccta 2160 

aaacttattt ttgaaaaatg ttggcagaga 2220 

caaacacaac gactcacatt caagccagcc 2280 

aaaagtgcca aggatggtaa aagaagaaac 2340 

tccccggctt ttcataaagg gctcctcgcc 2400 

gacttccaag tagaaataat aggcggggag 2460 

tcttagggca actggcagtg aattttgtct 2520 

acgcgcgagc cccgcgaaag gtttagggat 2580 

cgtgggggca gtgtcttgta gaatctttct 2640 

gtggagacga cgcaaagtca agcagcaaag 2700 



gtggcctggg 
gcctacggga 
acttcgggtc 
caaagcgtaa 
5 actttgcttc 
caggaggccg 
gccgtcgctg 
gctccggccc 
ttagaagatc 

10 gcgcgggggc 
gctcccaaag 
actccttggt 
tatcctcgac 
ctgccgggct 

15 atgctccggc 
aaactagccg 
ggaaactaag 
aaacctcacc 
aactcccggc 

20 agggccacct 
agaagcaaga 
tgagttcttt 
ccaaaccact 



aggcaagcgg 
tgcgcattcc 
cgcagccaga 
aaaattacta 
acataagttt 
gactgcaacc 
ctgcctcctg 
ccggttcttc 



agggctcaag 
caagaagtgc 
atttaatggc 
tgtcatttat 
ggctggaaag 
ggttgcttcc 
gcgcccctgg 
tccttgccta 



caaaaaccaa accaaaacac 



tgccaaacag 
attcgaacac 
ccccttaact 
cagggccaca 
tcgtatccaa 
cggaatatac 
ctgttttccc 
tgctgctgac 
aagtaaacaa 
cacccaactg 
ctttccccag 
gccagactca 
ctgcgctttc 
cttttagatt 



agactagacg 
taactcgcgc 
ggcttncgcc 
ggcagtgctc 
accctcccct 
gcaggctttg 
ggcttaaccg 
tccaatttta 
ctactagccg 
agggacgttc 
tgaccccaag 
agggtgcaaa 
ctgaagttcc 
aacaacccca 
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tgccgcatct 

gcccttcgag 

gacaacgttt 

tgaaacgeca 

cttgcagccc 

ctcccgtcgc 

gattttatac 

ggggttgttt 

cccctacccg 

aagggagtca 

ccgtgggccg 

tcctggtcaa 

ggcggagtgg 

tcacccctcc 

ggcgtttgcc 

tagaagaatt 

ggtaggcggc 

atcgaaatac 

gctttcagtc 

atcatggcca 

gcaagggnat 

cgccctcttg. 

tctctactcc 



ttaccctcag 
taagtcctgg 
atgcaatgca 
ttctttgtca 
cagcccgggc 
gcctggccgt 
gcacctctga 
cccaatagat 
ccccaaacac 
gatttagcga 
atggaggttc 
tcactgagca 
ctccaggagt 
tccccaaact 
caagggtttt 
agatattcct 
aaccgccttc 
gcccggctta 
ccgacctctg 
ctcccctacc 
acgcttcttt 
gagcctacct 
caccgcattc 



ggtctcctgc 
gcccgcacac 
agctaaaaac 
aactgcaact 
cagccaggta 
cccacgctgc 
aacacgctcc 
actgactcct 
ctgctctggg 
agctcttcga 
tccctactcc 
accagaatgg 
tacccgctcc 
gggcgccagg 
cttccctcct 
cactggaaag 
cgcctggngn 
taactggtgc 
gaacccacaa 
cgacagttct 
gaagcttgac 
gcccctccct 
gaccctgccc 



ggactcactg cttacctgaa cgactctcca gtgagacgag gctcccacac tggcgaaggc 
25 caagaagggg aggtgggggg agggttgtgc cacaccggcc agctgagagc gcgtgttggg 
ttgaagagga gggtgtctcc gagagggacg ctccctcgga cccgccctca ccccagctgc 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
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gagggcgccc 


CCaayyayCa 


yeyeyeyeey 


rrt" crncccicici 
ee i-yy^^yyy 


c 1 1 ooac t crc 


taacrtcraatcr 


4320 




gagcggccga 


gcc tec tggc 


♦* *^ 4— ^ ^» 4— ft 

ccctcccccc 


ccccgcgccg 


/-» »-» f~rf~% c* t - 

LLyytLtuLL 


t - 1* a t" t" t~ fir 


T J o u 




tttgggaagc 


cgagggcagc 


caggcagc eg 


yyy Laayyay 


l. Leaay y^oy 


ccrcccacacc 


4440 




cgggggctct 


ccgcaacccg 


aCCgCC Cytc 


eye e e e e <— e cx 


t" t" cccctccc 


t - ccctLcccaiC 

l~ V V_-» **w- >^ 


4500 


5 


ctactcattc 


acccacccac 


ccacccay ag 


ccyyyacgyc 


cxy e^eoyyey 




4560 




gccgtctcct 


cgccgcgatc 


c tggac t tec 


4— 4— 4— y^r f*v 4— j-r f-t 

CCuLgCLyCa 


yyacccgyL. t. 


\-ccacci\~Cf\~Ci 


4620 




tcccggagcc 


ggcgtctcag 


cacacgctcc 


gc tccgggcc 


u y y g l gc c c a. 


cayLayuLay 


*± o o v 




agcagcaggg 


agtcegggae 


ccgggcggca 


tc tgggccaa 


yLLayyCyCC 




4740 




gcgctgaacg 


tctccagggc 


eggaggagee 


gcggggcgtc 


*-+ r*r /~r r*t ♦- 4~ /->r 24 /T 

Cyyy tc tgay 


Lw. Leay eaaa 


4800 


10 


tgggctccga 


cgtgcgggac 


c tgaacgege 


t get gcc cgc 


Cy uccccucc 


ct.ggyi.ygey 


48 60 




gcggcggctg 


tgccctgcct 


gtgagcggcg 


eggegcagtg 


ggcgccggcg 


ctyyatLLLy 


^± Z7 \J 




cgcccccggg 


cgcttcggct 


tacgggtcgt 


tgggcggccc 


cgcgccgcca 


cegge tccge 






cgccaccccc 


gccgccgccg 


cctcactcc t 


tcatcaaaca 


ggagecgage 


t-ggggeggeg 






cggagccgca 


cgaggagcag 


tgcctgagcg 


ccttcactgc 


ccacctttcc 


yyLLdy l tea 


5100 


15 


ctggcacagc 


eggagectgt 


cgc taeggge 


j^l /-t y-r 

CCtCCgytCC 


LLC LLLytLt 


ayeeayy *~ 


5160 




catccggcca ggccaggatg 


t ttcctaacg 


cgccc tacc t 


yCCCayC tyc 


/>-» f- /^/»f a ClZk CtC C 

e Leyayay 


5220 




agcccgctat 


tegcaatcag 


ggtaagtagg 


ccggggagcg 


#— • -3 f~T I—+ 

CCCCC UciC-.y v- 


rrcnf^rrrrCri Cf t* 

y*-yyyy v * a y ^** 


5280 




ggcgccaggg 


actctccgct 


ctaggacacc 


cccctctcct 


accccttttg 


accgcagctc 


5340 




ttacccagct gcttcccaag 


ggccgtgagg 


atageggaag 


eggeggctgg 


ggaggaggee 


5400 


20 


ggagagtggg 


agtgcacgca 


ggcactggcc 


cccgacatcc 


tccaaagcca 


ggcagagcta 


5460 




ggagcctgac 


tgttcgcaag 


ageegggagg 


gcgtctgggg 


ccc 




5503 



<210> 13 
25 <211> 1816 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> interferon regulatory factor 7B_IRF-7 

<222> (1) . . (1816) 

5 <223> GenBank accession No. U53831 

<400> 13 

ggcacccagg gtccggcctg cgccttcccg ccaggcctgg acactggttc aacacctgtg 60 

acttcatgtg tgcgcgccgg ccacacctgc agtcacacct gtagccccct ctgccaagag 120 

10 atccataccg aggcagcgtc ggtggctaca agccctcagt ccacacctgt ggacacctgt 180 

gacacctggc cacacgacct gtggccgcgg cctggcgtct gctgcgacag gagcccttac 240 

ctcccctgtt ataacacctg accgccacct aactgcccct gcagaaggag caatggcctt 300 

ggctcctgag agggcagccc cacgcgtgct gttcggagag tggctccttg gagagatcag 3 60 

cagcggctgc tatgaggggc tgcagtggct ggacgaggcc cgcacctgtt tccgcgtgcc 420 

15 ctggaagcac ttcgcgcgca aggacctgag cgaggccgac gcgcgcatct tcaaggcctg 



480 



ggctgtggcc cgcggcaggt ggccgcctag cagcagggga ggtggcccgc cccccgaggc 540 

tgagactgcg gagcgcgccg gctggaaaac caacttccgc tgcgcactgc gcagcacgcg 600 

tcgcttcgtg atgctgcggg ataactcggg ggacccggcc gacccgcaca aggtgtacgc 660 

gctcagccgg gagctgtgct ggcgagaagg cccaggcacg gaccagactg aggcagaggc 720 

20 ccccgcagct gtcccaccac cacagggtgg gcccccaggg ccattcttgg cacacacaca 780 

tgctggactc caagccccag gccccctccc tgccccagct ggtgacaagg gggacctcct 840 
gctccaggca gtgcaacaga gctgcctggc agaccatctg ctgacagcgt catggggggc 
agatccagtc ccaaccaagg ctcctggaga gggacaagaa gggcttcccc tgactggggc 

ctgtgctgga ggcgaggccg cggccccaga gtccccgcac caggcagagc cgtacctgtc 1020 

25 accctcccca agcgcctgca ccgcggtgca agagcccagc ccaggggcgc tggacgtgac 1080 

catcatgtac aagggccgca cggtgctgca gaaggtggtg ggacacccga gctgcacgtt 1140 



900 
960 
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cctatacggc cccccagacc cagctgtccg ggccacagac ccccagcagg tagcattccc 12 00 

cagccctgcc gagctcccgg accagaagca gctgcgctac acggaggaac tgctgcggca 12 60 

cgtggcccct gggttgcacc tggagcttcg ggggccacag ctgtgggccc ggcgcatggg 1320 

caagtgcaag gtgtactggg aggtgggcgg acccccaggc tccgccagcc cctccacccc 13 80 

5 agcctgcctg ctgcctcgga actgtgacac ccccatcttc gacttcagag tcttcttcca 1440 

agagctggtg gaattccggg cacggcagcg ccgtggctcc ccacgctata ccatctacct 1500 

gggcttcggg caggacctgt cagctgggag gcccaaggag aagagcctgg tcctggtgaa 15 60 

gctggaaccc tggctgtgcc gagtgcacct agagggcacg cagcgtgagg gtgtgtcttc 1620 

cctggatagc agcagcctca gcctctgcct gtccagcgcc aacagcctct atgacgacat 1680 

10 cgagtgcttc cttatggagc tggagcagcc cgcctagaac ccagtctaat gagaactcca 1740 

gaaagctgga gcagcccacc tagagctggc cgcggccgcc cagtctaata aaaagaactc 1800 

cagaacaaaa aaaaaa 1816 



15 <210> 14 

<211> 2056 

<212> DNA 

<213> Homo sapiens 

<220> . 

20 <221> Interferon-induced 60 kDa protein (CIG49) 
<222>- (1) . . (2056) 

<223> GenBank accession No. AF026939 
<400> 14 

gtggaaacct cttcagcatt tgcttggaat cagtaagcta aaaacaaaat caaccgggac 
25 cccagctttt cagaactgca gggaaacagc catcatgagt gaggtcacca agaattccct 
ggagaaaatc ctcccacagc tgaaatgcca tttcacctgg aacttattca aggaagacag 



tgtctcaagg gatctagaag atagagtgtg 
caaagctaca atgtacaact tgttggccta 
agccctggaa tgcttacggc aagctgaaga 
agaaatcaga agtctagtca cttggggaaa 
5 actctcagat gctcagattt atgtagataa 
tccatacagt attgagtatt ctgaacttga 
tggaagaaat gaaagggcga aggtgtgttt 
cccagaattc tcctctggac tggcaattgc 
acagttctct actgatgttt tgaagcaggc 

10 caaggttctc ttgggcctga aactgcagaa 
tgttgaagaa gccttggaaa agtctccttg 
attttacaga agaaaaggtg acctagacaa 
atccacacca aacaatggct acctctatca 
aagacaaatg cagaatacag gagaatctga 

15 actaaagcaa tatgctatgg actattcgaa 
gaatgcatac tccgatctcg ctgagttcct 
taaggaagtc cctgatgctg aaaagcaaca 
atataatggg aagtctgaag acactgctgt 
caaaaaatca actgacaagg aagagatcaa 

20 gcttccacaa aatgcaccaa attattggta 
agatctgctg caagcagcca aatgttatga 
cccttcaggc ataggcagta ttttcctgtc 
aatgggccag ggcgcagtca gctccagtcc 
gaactgagac agaggaggaa aacagagcat 

25 aggaggatag gaagacaggg ggccccaacc 
ttgctctaag gtacattttt aaagagttgt 
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taaccagatt gaatttttaa acactgagtt 240 

cataaaacac ctagatggta acaacgaggc 3 00 

gttaatccag caagaacatg ctgaccaagc 3 60 

ctacgcctgg gtctactatc acttgggcag 420 

ggtgaaacaa acctgcaaga aattttcaaa 480 

ctgtgaggaa gggtggacac aactgaagtg 540 

tgagaaggct ctggaagaaa agcccaacaa 600 

gatgtaccat ctggataatc acccagagaa 6 60 

cattgagctg agtcctgata accaatacgt 720 

gatgaataaa gaagctgaag gagagcagtt 780 

ccaaacagat gtcctccgca gtgcagccaa 840 

agctattgaa ctgtttcaac gggtgttgga 900 

ccagattggg tgctgctaca aggcaaaagt 960 

agctagtgga aataaagaga tgattgaagc 1020 

taaagctctt gagaagggac tgaatcctct 108 0 

ggagacggaa tgttatcaga caccattcaa 1140 

atcccatcag cgctactgca accttcagaa 1200 

gcaacatggt ttagagggtt tgtccataag 1260 

agaccaacca cagaatgtat ccgaaaatct 132 0 

tcttcaagga ttaattcata agcagaatgg 1380 

gaaggaactg ggccgcctgc taagggatgc 1440 

agcatctgag cttgaggatg gtagtgagga 1500 

cagagagctc ctctctaact cagagcaact 1560 

cagaagcctg cagtggtggt tgtgacgggt 1620 

tgggattgct gagcagggaa gctttgcatg 1680 

tttttggccg ggcgcagtgg ctcatgcctg 1740 
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taatcccagc actttgggag gccgaggtgg gcggatcacg aggtctggag tttgagacca 1800 

tcctggctaa cacagtgaaa tcccgtctct actaaaaata caaaaaatta gccaggcgtg 1860 

gtggctggca cctgtagtcc cagctacttg ggaggctgag gcaggagaat ggcgtgaacc 1920 

tggaaggaag aggttgcagt gagccaagat tgcgcccctg cactccagcc tgggcaacag 1980 

5 agcaagactc ggaattcctg cagcccgggg gatccactat tctagagcgc cgcaacggcc 2040 

gtggagtcca gagatg 2056 



<210> 15 

10 <211> 1879 

<212> DNA 

<213> Homo sapiens 
<220> . 

<221> nuclear autoantigen (SP-100) 

15 <222> (1)..(1879) 

<223> GenBank accession No. M60618 



<400> 15 





ctgaggccca 


cgcagggcct 


agggtgggaa 


gatggcaggt 


gggggcggcg 


acctgagcac 


60 


20 


caggaggctg 


aatgaatgta 


tttcaccagt 


agcaaatgag 


atgaaccatc 


ttcctgcaca 


120 




cagccacgat 


ttgcaaagga 


tgttcacgga 


agaccagggt 


gtagatgaca 


ggctgctcta 


180 




tgacattgta 


ttcaagcact 


tcaaaagaaa 


taaggtggag 


atttcaaatg 


caataaaaaa 


240 




gacatttcca 


ttcctcgagg 


gcctccgtga 


tcgtgatctc 


atcacaaata 


aaatgtttga 


300 




agattctcaa 


gattcttgta 


gaaacctggt 


ccctgtacag 


agagtggtgt 


acaatgttct 


360 


25 


tagtgaactg 


gagaagacat 


ttaacctgcc 


agttctggaa 


gcactgttca 


gcgatgtcaa 


420 




catgcaggaa 


taccccgatt 


taattcacat 


ttataaaggc 


tttgaaaatg 


taatccatga 


480 



i 
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caaattgcct ctccaagaaa gtgaagaaga agagagggag gagaggtctg gcctccaact 540 

aagtcttgaa caaggaactg gtgaaaactc ttttcgaagc ctgacttggc caccttcggg 600 

ttccccatct catgctggta caaccccacc tgaaaatgga ctctcagagc acccctgtga 660 

aacagaacag ataaatgcaa agagaaaaga tacaaccagt gacaaagatg attcgctagg 720 

5 aagccaacaa acaaatgaac aatgtgctca aaaggctgag ccaacagagt cctgcgaaca 780 

aattgctgtc caagtgaata atggggatgc tggaagggag atgccctgcc cgttgccctg 840 

tgatgaagaa agcccagagg cagagctaca caaccatgga atccaaatta attcctgttc 900 

tgtgcgactg gtggatataa aaaaggaaaa gccattttct aattcaaaag ttgagtgcca 960 

agcccaagca agaactcatc ataaccaggc atctgacata atagtcatca gcagtgagga 1020 

10 ctctgaagga tccactgacg ttgatgagcc cttagaagtc ttcatctcag caccgagaag 1080 

tgagcctgtg atcaataatg acaacccttt agaatcaaat gatgaaaagg agggccaaga 1140 

agccacttgc tcacgacccc agattgtacc agagcccatg gatttcagaa aattatctac 12 00 

attcagagaa agttttaaga aaagagtgat aggacaagac cacgactttt cagaatccag 1260 

tgaggaggag gcgcccgcag aagcctcaag cggggcactg agaagcaagc atggtgagaa 1320 

15 ggctcctatg acttctagaa gtacatctac ttggagaata cccagcagga agagacgttt 1380 

cagcagtagt gacttttcag acctgagtaa tggagaagag cttcaggaaa cctgcagctc 1440 

atccctaaga agagggtcag gtaaagaaga ttaggatgcc aagacttggc ctgcagaatg 1500 

tcaggaatgt gaattaaaag ctgctgtttc cagacgcttt ttattctgag caccttcact 1560 

accttgtatc cagttcatct gggaactcct ttttgcattt tagaaaatgg aaagaggcag 1620 

20 gaaattatga taaactcatg tttaacagaa agagtttcac tgactaaatg tatgtaatta 1680 

tattttgttg ttgtagaaga aataaatagc aaatttgtgg tattcttttt tttaaacctg 1740 

ctctcattcc tattaacact aagatcttag atttttatag tgataaatgg gttgacatca 1800 

ttgtcgtttg taattgtaaa gcctcaaaag acaactgttc ctactatgta attatagaca 1860 

gaaataaaaa cttcagatc 1879 

25 
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<210> 16 

<211> 3997 

<212> DNA 

<213> Homo sapiens 
5 <220> 

<221> KIAA0793 

<222> (1) . . (3997) 

<223> GenBank accession No. AB018336 



10 <400> 16 





ggcgacgccg 


acgcgagtct 


ggcggctgct 


gcttgcgact 


gcggaggccg 


ggcgaggccg 


60 




gtgaggacgc 


ggcgggcgag 


cgagaggccg 


aggtgttttc 


ttcactcatg 


gtgaagaatg 


120 




ggggagatag 


aaggaacata 


cagagtcctg 


cagactgcag 


ggatgcgctt 


gggtgcccag 


180 




acccctgtgg 


gagttagcac 


ccttgagcct 


gggcagactc 


tcttgcccag 


aatgcaagag 


240 


15 


aagcacctgc 


acctcagagt 


aaagctgctg 


gacaacacca 


tggaaatatt 


tgacattgag 


300 




cctaaatgcg 


atggccaggt 


attactgaca 


caagtgtgga 


agcgtttaaa 


cctggtagaa 


360 




tgtgactact 


tcgggatgga 


gtttcaaaat 


actcagtcct 


actggatttg 


gcttgaacct 


420 




atgaaaccca 


tcattaggca 


aatacgaagg 


ccaaagaatg 


tggtgcttcg 


cctagctgta 


480 




aaattttttc 


cacctgatcc 


tggtcagcta 


caagaagaat 


atacaagata 


cttgtttgcc 


540 


20 


ttgcaactta 


agagagacct 


gctggaagag 


cgtttgacct 


gtgctgacac 


cacagcggcc 


600 




cttctcacg : t 


cccatctcct 


gcagtcggaa 


ataggagatt 


acgatgaaac 


gctggaccga 


660 




gagcacctca 


aagtgaacga 


gtatttgcct 


ggccagcagc 


actgccttga 


gaagatacta 


720 




gaattccatc 


agaagcacgt 


gggccagaca 


cctgctgagt 


cggatttcca 


ggtgctcgaa 


780 




attgctcgaa 


agttggaaat 


gtacggcatc 


agatttcaca 


tggcttctga 


cagggaagga 


840 


25 


accaagattc 


aactggcagt 


ttcccacatg 


ggtgtactcg 


tgttccaggg 


caccaccaaa 


900 




atcaacactt 


tcaactggtc 


caaggtccgt 


aaactaagct 


tcaagaggaa 


aagatttctt 


960 



atcaaacttc atccagaggt 
agtagagatg aatgtaagaa 
cttttggacc aacctaagcc 
agatacagtg gaagaactca 
5 agaattccat atgaaagaag 
gacctaccaa aacagagcat 
tcagcgaatg ccttttactc 
tttaaggaca gcagcagctc 
gcagagaggc gcagtggagc 

10 gggccccccg cactccagcc 
tccagccgga agagccccct 
gaacagggct catccccact 
gactgcgagg agcccagaca 
gagattctcg ctacagaacg 

15 cgcagcgcag tggtgaagga 
aacatcgatc ccatctatga 
gcactctggg aagggccctc 
atcctgctca ggaacatgcg 
gaggtcctaa cagaactgga 

20 aaggagtttg agctgcagaa 
atccagcggc tgctgcacta 
gggcaccatg actacgctga 
acactacagc acattctcat 
gacctggtgg gcatagagaa 

25 cttcacaagc tcaccaagaa 
ttgctgtaca caagcaaagg 
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tcatggacct taccaggaca 
cttctggaag atttgtgtgg 
aaaagcaaaa gccgtcttct 
gaaacaacta gtagattatt 
gcacagcaag acccacacgt 
ctcattcccc gagggattga 
gctctctccc tccactctgg 
cctcacagat ccccaggttt 
agtggctgga ggccccgaca 
tggtccaggc ctttccacga 
gagtctgagc cctgcatttc 
cctgagccct gtcctcagtg 
caagcgcgtg cctgcagacg 
aacatacctc aaggatttag 
ggacgccatg cctgcgactc 
gttccacaga ggcttcctgc 
caaagcccac acaaaaggca 
ccagttaaag gagtttacca 
aaaggctacc aaacgctgta 
ggtctgctac ttgcctctca 
ccgcctgctg ctgcgccgcc 
ctgccatgac gccctgaaag 
ccggctggag aacctgcaga 
cctcattgct cctggcaggg 
gggcctgcag cagaggatgt 
agttgcaggg accagccact 



cattagaatt tttgttgggt 1020 

agtatcacac cttttctaga 1080 

tcagccgggg ctcctccttc 1140 

tcaaagacag tggaatgaag 12 00 

ccgttcgagc tctgactgca 1260 

ggactcctgc ctccccatct 1320 

tcccctctgg cctgccagag 1380 

cctacgtcaa gagtccagct 1440 

caccatcggc ccagcccctc 1500 

agagtcctca gccttctccc 1560 

aggtgccttt gggcccagct 1620 

atgctggcgg agccgggatg 1680 

aggcctactt catagtcaaa 1740 

aagttattac cgtgtggttc 1800 

tgatgacgct gctcttctcc 1860 

gcgaggtgga gcagaggctg 192 0 

gtcatcaacg aatcggggac 1980 

gctacttcca aagacatgac 2040 

agaagttgga ggcagtgtac 2100 

acacgttcct gctgaagccc 2160 

tatgcggaca ttacagcccc 2220 

ccatcacaga ggtgaccacc 2280 

agctaacgga gctgcagcgg 2340 

agttcatccg tgagggctgc 2400 

tttttctgtt ctcagatatg 2460 

tccggatccg gggcctcctt 2520 



cccctccaag gcatgctggt ggaagaaagt 
accatctacg cggctcagaa aacaatcgtg 
aagtggatgc tggacctgaa ctccgcgatc 
cctgcactgc caggccgcac tgtgtgcact 

5 ctggagcagg agtcagaaga tgatgctcgg 
cagcaccggg ccaacaccac aatgcacgtg 
gcagaccaca gtgcagctgt cgagaaccag 
aacagtcatg gctggcagaa gctctgggtc 
aaaactcatc aggatgacta cccactggcc 

10 atccccaggg aggccgatgg catacacaaa 
cacgtctact tcttccgggc tgagagcaag 
cagggggcca gcagctcagc cgggagggcc 
tcctcagggc tggaggggat ggtcaggggg 
ttggacacaa ctacaaagaa cagcaggaca 

15 acagatggga agtggctgtg gtctcactgg 
cctcatgtaa catctgggag gggcttcatc 
.gctatctggg gccctgggaa aaatgtgcga 
ctcccaggcc cctggctcaa agacgcagac 
caaagcctgg gcacaccctg cctctctccc 

20 agagcaagca ctggaaaaga ggccctgcca 
ctcagacaag cgtggcacag ccatgctgac 
aggggcctgc cctgcaggtc acagctaaac 
tgtacctcta gagaagcaga aaccaaagtc 
aatttattac tgcccagcat tccttccaac 

25 acacatttga ttaaaaataa acaaacagca 
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gataacgagt ggtctgttcc acactgtttc 2580 

gtggcagcca gcactcggct ggagaaagag 2640 

caagcagcca agagtggcgg tgacacggcc 2700 

cgtcccccca gatcccecaa cgaggtatct 2760 

ggtgtccgca gctccctgga ggggcatggc 2820 

tgctggtacc ggaacaccag cgtgtccagg 2880 

ctttcaggat atctgctaag aaagttcaaa 2940 

gtctttacca acttctgttt gttcttctac 3000 

agcctcccgc tgctgggcta cagcgtgagc 3060 

gactatgttt tcaagctcca gttcaaatcc 3120 

tacacatttg aaaggtggat ggaggtgatc 3180 

ccaagcatcg tgcaggatgg cccccaaccc 3240 

aaggaggaat gacgctcaac ctgcccaggt 3300 

cagaggtgac ctctgtcctg aggcttctca 3360 

atccccactg gcaccagcag tgtgggtggg 3420 

cccccaccca ggacctagtg catgccagca 3480 

gtcttgagcg cggagccgct caagccacag 3540 

aaggcctgag cagtgctctc ggcatcggac 3600 

cagagcaggg tccctgccga ggactggcct 3 660 

tacaccctgc gtacccactg ccaggaccct 372 0 

cttccatctg gtgaaccaag tggcagcccc 3780 

aagtctggca gaagccacgc ttgttcccca 3840 

cccctgtgcc ctgggagggt ggggccgtct 3900 

gggaagtaga tgggcgactg ctttgttcac 3960 

tctcccc 3997 



! 
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<210> 17 
<211> 2905 
<212> DNA 
5 <213> Homo sapiens 
<220> 

<221> 2 '5' oligoadenylate synthetase (p69 2-5A synthetase) 
<222> (1) . . (2905) 

<223> GenBank accession No. M87434 

10 

<400> 17 

cggcagccag ctgagagcaa tgggaaatgg ggagtcccag ctgtcctcgg tgcctgctca 60 

gaagctgggt tggtttatcc aggaatacct gaagccctac gaagaatgtc agacactgat 12 0 

cgacgagatg gtgaacacca tctgtgacgt ctgcaggaac cccgaacagt tccccctggt 180 

15 gcagggagtg gccataggtg gctcctatgg acggaaaaca gtcttaagag gcaactccga 4 240 

tggtaccctt gtccttttct tcagtgactt aaaacaattc caggatcaga agagaagcca 300 

acgtgacatc ctcgataaaa ctggggataa gctgaagttc tgtctgttca cgaagtggtt 3 60 

gaaaaacaat ttcgagatcc agaagtccct tgatgggtcc accatccagg tgttcacaaa 42 0 

aaatcagaga atctctttcg aggtgctggc cgccttcaac gctctgagct taaatgataa 480 

20 tcccagcccc tggatctatc gagagctcaa aagatccttg gataagacaa atgccagtcc 540 

tggtgagttt gcagtctgct tcactgaact ccagcagaag ttttttgaca accgtcctgg 600 

aaaactaaag gatttgatcc tcttgataaa gcactggcat caacagtgcc agaaaaaaat 660 

caaggattta ccctcgctgt ctccgtatgc cctggagctg cttacggtgt atgcctggga 720 

acaggggtgc agaaaagaca actttgacat tgctgaaggc gtcagaacgg ttctggagct 780 

25 gatcaaatgc caggagaagc tgtgtatcta ttggatggtc aactacaact ttgaagatga 84 0 

gaccatcagg aacatcctgc tgcaccagct ccaatcagcg aggccagtaa tcttggatcc 900 
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agttgaccca accaataatg tgagtggaga taaaatatgc tggcaatggc tgaaaaaaga 960 

agctcaaacc tggttgactt ctcccaacct ggataatgag ttacctgcac catcttggaa 1020 

tgtcctgcct gcaccactct tcacgacccc aggccacctt ctggataagt tcatcaagga 1080 

gtttctccag cccaacaaat gcttcctaga gcagattgac agtgctgtta acatcatccg 1140 

5 tacattcctt aaagaaaact gcttccgaca atcaacagcc aagatccaga ttgtccgggg 12 00 

aggatcaacc gccaaaggca cagctctgaa gactggctct gatgccgatc tcgtcgtgtt 1260 

ccataactca cttaaaagct acacctccca aaaaaacgag cggcacaaaa tcgtcaagga 1320 

aatccatgaa cagctgaaag ccttttggag ggagaaggag gaggagcttg aagtcagctt 1380 

tgagcctccc aagtggaagg ctcccagggt gctgagcttc tctctgaaat ccaaagtcct 1440 

10 caacgaaagt gtcagctttg atgtgcttcc tgcctttaat gcactgggtc agctgagttc 1500 

tggctccaca cccagccccg aggtttatgc agggctcatt gatctgtata aatcctcgga 1560 

cctcccggga ggagagtttt ctacctgttt cacagtcctg cagcgaaact tcattcgctc 1620 

ccggcccacc aaactaaagg atttaattcg cctggtgaag cactggtaca aagagtgtga 1680 

aaggaaactg aagccaaagg ggtctttgcc cccaaagtat gccttggagc tgctcaccat 1740 

15 ctatgcctgg gagcagggga gtggagtgcc ggattttgac actgcagaag gtttccggac 1800 

agtcctggag ctggtcacac aatatcagca gctcggcatc ttctggaagg tcaattacaa 18 60 

ctttgaagat gagaccgtga ggaagtttct actgagccag ttgcagaaaa ccaggcctgt 1920 

gatcttggac ccaggcgaac ccacaggtga cgtgggtgga ggggaccgtt ggtgttggca 1980 

tcttctggac aaagaagcaa aggttaggtt atcctctccc tgcttcaagg atgggactgg 2040 

20 aaacccaata ccaccttgga aagtgccgac aatgcagaca ccaggaagtt gtggagctag 2100 

gatccatcct attgtcaatg agatgttctc atccagaagc catagaatcc tgaataataa 2160 

ttctaaaaga aacttctgga gatcatctgg caatcgcttt taaagactcg gctcaccgtg 2220 

agaaagagtc actcacatcc attcttccct tgatggtccc tattcctcct tcccttgcct 2280 

tcttggactt cttgaaatca atcaagactg caaacccttt cataaagctg ccttgctgaa 2340 

25 ctcctctctg caggagccct gcttaaaata gttgatgtca tcactttatg tgcatcttat 2400 

ttctgtcaac ttgtattttt ttttcttgta tttttccaat tagctcctcc tttttccttc 2460 
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cagtctaaaa aaggaatcct ctgtgtcttc aaagcaaagc tctttacttt ccccttggtt 2520 

ctcataactc tgtgatcttg ctctcggtgc ttccaactca tccacgtcct gtctgtttcc 2580 

tctgtataca aaaccctttc tgcccctgct gacacagaca tcctctatgc cagcagccag 2640 

gccaaccctt tcattagaac ttcaagctct ccaaaggctc agattataac tgttgtcata 2700 

5 tttatatgag gctgttgtct tttccttctg agcctgcctt tatcccccca cccaggagta 2760 

tcctcttgcc aaagcaaaag actttttcct tggctttagc cttaaagata cttgaaggtc 2820 

taggtgcttt aacctcacat accctcactt aaacttttat cactgttgca tataccagtt 2880 

gtgatacaat aaagaatgta tctgg 2905 



10 

<210> 18 

<211> 840 

<212> DNA 

<213> Homo sapiens 
15 <220> 

<221> histone H2A 

<222> (1) . . (840) 

<223> GenBank accession No. Z80776 



20 <400> 18 

ggatgaattg tctctgtgtg ccctgtggta tgcaaattag gtgattctag cgtttctttg 60 

caattggttg gttgttctag ccaatcaaat agcgttattt acaattcgac ctccaagtat 120 

aatagtttct cacttgctaa caaaactcac ttttzacattt ttgtcttcat tgcttaacat 180 

cgtttttaag aattcaaaat gtccggacgc ggcaagcaag gcggaaaggc ccgagctaag 240 

25 gctaagaccc gctcttcgcg ggccggactc cagttccctg tgggccgcgt acaccgcttg 300 

ctccgcaagg gcaactactc cgagcgagtc ggggccggcg cgccagtgta tctggcggcg 360 
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gtgttggagt acctgaccgc cgagatcctg gagctggcgg gcaacgccgc ccgcgacaac 420 

aagaagaccc gcatcatccc ccgacacctg cagctggcca tccgcaacga cgaggagcta 480 

aacaagttgc tgggtaaagt cacaattgct cagggcggtg ttctgcccaa catccaggct 540 

gtactgctcc ccaagaagac tgagagtcac cacaaggcca agggcaagta aaacgagaac 600 
5 tcttatattg gctttttaag gaagcagtct taacaaaggc tcttttcaga gccacccatg 



atttttccat ttataagtgg gtgttcttaa cactgaatcg tgaaactaac cgtccatttt 
aaagatgcca taaaaataca taagcaaatt ggcggggcgc ggttgcttac gcctgtaatc 



660 



tattccttaa aagggctcac attttctgtg taaatagcta ttttgcagtt ttctttaata 720 



780 
840 



10 

<210> 19 
<211> 1230 
<212>- DNA 
<213> Homo sapiens 
15 <220> 

<221> GPR12 G protein coupled-receptor 
<222> (1)..(1230) 

<223> GenBank accession No. U18548 
<400> 19 

20 aagcttgtgg catttggtac tggtatctga gcaggggctg gctttctgtt tgtctgtgtg 
ttttttgcat gatcttggat tgtcaccctg ctgtatttaa acattaaaaa gcctgtcttt 
tcgttgaaga ggacaggggt taaaatgaat gaagacctga aggtcaattt aagcgggctg 180 
cctcgggatt atttagatgc cgctgctgcg gagaacatct cggctgctgt ctcctcccgg 
gttcctgccg tagagccaga gcctgagctc gtagtcaacc cctgggacat tgtcttgtgt 

25 acctcgggaa ccctcatctc ctgtgaaaat gccattgtgg tccttatcat cttccacaac 
cccagcctgc gagcacccat gttcctgcta ataggcagcc tggctcttgc agacctgctg 



60 
120 



240 
300 
360 
420 
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gccggcattg gactcatcac caattttgtt tttgcctacc tgcttcagtc agaagccacc 480 

aagctggtca cgatcggcct cattgtcgcc tctttctctg cctctgtctg cagcttgctg 540 

gctatcactg ttgaccgcta cctctcactg tactacgctc tgacgtacca ttcggagagg 600 

acggtcacgt ttacctatgt catgctcgtc atgctctggg ggacctccat ctgcctgggg. 660 

5 ctgctgcccg tcatgggctg gaactgcctc cgagacgagt ccacctgcag cgtggtcaga 720 

ccgctcacca agaacaacgc ggccatcctc tcggtgtcct tcctcttcat gtttgcgctc 780 

atgcttcagc tctacatcca gatctgtaag attgtgatga ggcacgccca tcagatagcc 840 

ctgcagcacc acttcctggc cacgtcgcac tatgtgacca cccggaaagg ggtctccacc 900 

ctggctatca tcctggggac gtttgctgct tgctggatgc ctttcaccct ctattccttg 960 

10 atagcggatt acacctaccc ctccatctat acctacgcca ccctcctgcc cgccacctac 1020 

aattccatca tcaaccctgt catatatgct ttcagaaacc aagagatcca gaaagcgctc 1080 

tgtctcattt gctgcggctg catcccgtcc agtctcgccc agagagcgcg ctcgcccagt 1140 

gatgtgtagc acccttgcac ccaggaggac tctgcattta ccaagcactt ccactgcctg 1200 

gccaaggttt gagatgcttc ccttgaattc 1230 

15 



<210> 20 

<211> 5141 

<212> DNA 

20 <213> Homo sapiens 
<220> 

<221> steroid 21 -hydroxylase (CYP21) 

<222> (1) . . (5141) 

<223> GenBank accession No. M12792 

25 



<400> 20 



agatctttct cccagtagct gctcagcatg 
gggattcagt tgcagcaaga catggcccgg 
gctgtcattg cccaacccca gaaatcccaa 
ggattcagca gcagcgaact cgccaatgta 

5 gctctgcagt agcacagtct ggaaaagctc 
ggtctggcac agagccagaa atggggggga 
caggtaccat ccaggaggcc ctggcctctc 
cctcttccca aatccccaaa ctgcctcctt 
cagtgggacc cccacccctt atggcttcag 

10 gatccagctg tggctatcca gccccttcct 
acttgtgctg ggaatccata ctttccctgg 
agggcagtca tgttctgcct gtgcctggaa 
agaggagaga ggaaeagagc gaggaaaccg 
ccttgacttt ctaagaccag tggggtcccc 

15 aattccccaa ttcttatttt ttatctcaaa 
ccaaagaaag gtgctcctgg tggctaaagg 
gaatcctgcc ctccacaaca tgcgaacaat 
agagttgagt gagtgcccac aaagcactta 
cagcgggaaa ccacttcctc ctttgfccttc 

20 aataacaatc acactgggat acctgtatat 
actgtattac tctatctgta gcttaaacta 
cttcaggcga ttcaggaagg cctattaggc 
atcattccat aaactggccc acggtgggtg 
.cactattgcc tgcacagtga tgtggaacca 

25 cagggttgca tttcccttcc ttgcttcttg 
gtcccaaggc caatgagact ggtgtcattc 



-71- 

gtggtggcat aagcccattt tccggagcca 60 

tctgggaggt caaccatgaa gaaggcagta 120 

tcctgttttc tccctctcag tcctgatcat 180 

gtgggtggca cagccagggt cttgactctg 240 

tgaggggaga gagaccccca ctggtccgag 300 

aggtatgagg ctgggtcgcc tctgacctct 360 

actgaacccg gccactcctc tttggcatgg 420 

acccacaaaa gtggtctctg agtgtcagtc 480 

ttccccaaat agggctggac ccttgatcct 540 

ggggactttg gactttgagg ggggcatgcc 600 

ctggagtaga acctgtggac tgtagtcctg 660 

agacaagaaa cttgactgca gagagaagaa 72 0 

cccgtctccg gggctttttc tgttccctat 780 

tcctctgctt ctttttcctg agttctgtga 840 

ccagctcaag gtgggctgtt ttcctttcaa 900 

tacatattcg acagctagat ttccaggctg 960 

acccgtgttg catatagagc atggctgtga 102 0 

gagcagtgtc tggtacatgc tattactccg 1080 

tgggcacttt tgtgagtgaa aggaggcact 11.40 

actggaatgc cccaggcaaa ccaggcttaa 1200 

acaaacaacc cacacaaatc acattttgtt 12 60 

agggactgcc attttctctc tgagacaaac 1320 

gcagagggag agggcccagg tgggggcgga 13 8 0 

gaaagctgac tctggatgca ggaaaaaggt 1440 

atgggtgatc aatttttttg aaatacggac 1500 

cagaaaaggg ccactctgtg ggcgggtcgg 1560 



-72- 

tgggagggta cctgaaggtg gggtcaaggg aggccccaaa acagtctaca cagcaggagg 1620 

gatggctggg gctcttgagc tataagtggc acctcagggc cctgacgggc gtctcgccat 1680 

gctgctcctg ggcctgctgc tgctgcccct gctggctggc gcccgcctgc tgtggaactg 1740 

gtggaagctc cggagcctcc acctcccgcc tcttgccccg ggcttcttgc acttgctgca 1800 

5 gcccgacctc ccaatctatc tgcttggcct gactcagaaa ttcgggccca tctacaggct 1860 

ccaccttggg ctgcaaggtg agaggctgat ctcgctctgg ccctcaccat aggagggggc 1920 

ggaggtgacg gagagggtcc tctctccgct gacgctgctt tggctgtctc ccagatgtgg 1980 

tggtgctgaa ctccaagagg accattgagg aagccatggt caaaaagtgg gcagactttg 2040 

ctggcagacc tgagccactt acctgtaagg gctgggggca ttttttcttt cttaaacaaa 2100 

10 ttttttttta agagatgggt tcttgctatg ttgcccaggc tggtcttaaa ttcctagtct 2160 

caaatgatcc tcccacctca gcctcaagtg tgagccacct ttggggcatc cccaatccag 2220 

gtccctggaa gctcttgggg ggcatatctg gtggggagaa agcaggggtt ggggaggccg 2280 

aagaaggtca ggccctcagc tgccttcatc agttcccacc ctccagcccc cacctcctcc 2340 

tgcagacaag ctggtgtcta agaactaccc ggacctgtcc ttgggagact actccctgct 2400 

15 ctggaaagcc cacaagaagc tcacccgctc agccctgctg ctgggcatcc gtgactccat - 2460 

ggagccagtg gtggagcagc tgacccagga gttctgtgag gtaaggctgg gctcctgagg 2520 

ccacctcggg tcagcctcgc ctctcacagt agcccccgcc ctgccgctgc acagcggcct 2580 

gctgaactca cactgtttct ccacagcgca tgagagccca gcccggcacc cctgtggcca 2640 

ttgaggagga attctctctc ctcacctgca gcatcatctg ttacctcacc ttcggagaca 2700 

20 agatcaaggt gcctcacagc ccctcaggcc cacccccagc ccctccctga gcctctcctt 27 60 

gtcctgaact gaaagtactc cctccttttc tggcaggacg acaacttaat gcctgcctat 2820 

tacaaatgta tccaggaggt gttaaaaacc tggagccact ggtccatcca aattgtggac 2880 

gtgattccct ttctcagggt gaggacctgg agcctagaca cccctgggtt gtaggggaga 2940 

ggctggggtg gagggagagg ctccttccca cagctgcatt ctcatgcttc ctgccgcagt 3 000 

25 tcttccccaa tccaggtctc cggaggctga agcaggccat agagaagagg gatcacatcg 3060 

tggagatgca gctgaggcag cacaaggtgg ggactgtacg tggacggcct cccctcggcc 312 0 



cacagccagt gatgctaccg gcctcagcat 
agttatgggc ctgttgccac tctgtactcc 
tttcaccctc tgcaggagag cctcgtggca 
ctccaagggg tggcgcagcc gagcatggaa 

5 gtgcacatgg ctgcagtgga cctcctgatc 
tcctgggccg tggttttttt gcttcaccac 
gctccttccc agcaacctgg ccagggcggt 
cggctggggc tgtgcttgcc tcaccggcac 
tggaggctgg gcagctgtgg gctgctgggg 

10 agcagcgact gcaggaggag ctagaccacg 
tcccctacaa ggaccgtgca cggctgccct 
gcctgcggcc cgttgtgccc ttagccttgc 
tcccgagggt tggggatgag tgaggaaagc 
aactccagcc cccttcagca tctccggcta 

15 gaacctccaa ggcgcccacc tggatgagac 
tggtatgtgg gggccggggg cctgccgtca 
ctgaacgcct ccccacccac ctgtccaccc 
gaactccaga gctctggcct tcggctgcgg 
gcgcctggac ctcttcgtgg tgctgacccg 

20 cggggacgcc ctgccctccc tgcagcccct 
gcctttccaa gtgcggctgc agccccgggg 
gtgatggggc aggaccgatg ccagccgggt 
gaacccctcc cctcccccct gtaaacacag 
ctccagcggc tgggtggtga aggaccctgg 

25 ggcagtcata ctggggtgcg agagaggtgg 
cgaaagtttc ttggtctcag cttcatttcc 



-73- 

tgctatgagg cgggttcttt tgcatacccc 3180 

tctccccagg ccagccgctc agcccgctcc 3240 

ggccagtgga gggacatgat ggactacatg 3300 

gagggctctg gacagctcct ggaagggcac 3360 

ggtggcactg agaccacagc aaacaccctc 3420 

cctgaggtgc gtcctgggga caagcaaaag 3480 

gggcaccctc actcagctct gagcactgtg 3540 

tcaggctcac tgggttgctg agggagcggc 3 600 

caggactcca cccgatcatt ccccagattc 3660 

aactgggccc tggtgcctcc agctcccggg 3720 

tgctcaatgc caccatcgcc gaggtgctgc 3780 

cccaccgcac cacacggccc agcaggtgac 3840 

ccgagcccag ggaggtcctg gccagcctct 3900 

cgacatccct gagggcacag tcatcattcc 3960 

ggtctgggag aggccacatg. agttctggcc 402 0 

aaatgtggtg gaggctggtc cccgctgccg 4080 

gcccgcagat cgcttcctgg agccaggcaa 4140 

tgccccggtg tgcctgggcg agccgctggc 4200 

actgctgcag gccttcacgc tgctgccctc 4260 

gccccactgc agtgtcatcc tcaagatgca 4320 

gatgggggcc cacagcccag gccagaacca 4380 

acctcagttt ctcctttatt gctcccgtac 4440 

tgctgcgaga tcgctggcag agaaggcttc 4500 

ctcttctctc ggggcgaccc ctcagtgctc 4560 

gcagcagctc agcctccccc cgctggggag 4620 

gtgaagggca ccgagaactc gaagcccttc 4680 



-74- 

cagtggtacc agctcactcc ctgggaaagg ggttgtcaag agagagtcaa agccggatgt 4740 

cccatctgct cttcccgttc cccttaagga ggtagctccc agcactcaac caacctcccc 4800 

gcagagctcc cttcctgacc ctccgctgca gaggattgag gcttaattct gagctggccc 4860 

tttccagcca ataaatcaac tccagctccc tctgcgaggc tggcatgatt gttccatttc 4920 

5 acccagccgc tcagtccctt gcctgttaca ctgtggggct gaaacctagg caggccgagc 4980 

cccagccacc ccagctctga gccggcctcc ccacccctca cctgatggtc cactgtgctc 5040 

ccgtagagcc cgttgaggtt ggcgtagtgg cagttcctgt accaccaggc ccctcggtag 5100 

gagacagcgc aggagatgag caagctgttg gggtcccgat c 5141 



10 

<210> 21 

<211> 1117 

<212> DNA 

<213> Homo sapiens 
15 <220> 

<221> putative tumor suppressor protein (101F6) 

<222> (1) . . (1117) 

<223> GenBank accession No. AF040704 



20 <400> 21 

tccggaagta aagcggctcc gtgacggagc ggcggtgcgc gcggcagggc ccggagtatc 60 

ccgctttctt tggaggaaac caccgcatca gatctgcgct gcggcagagg caggctacaa 120 

ccactagcac ggctgacgat ggccctttct gcggagaccg agtcacacat ctaccgagct 180 

ctgcgtactg cttctggcgc tgccgcccac cttgtggccc tgggctttac catctttgtg 240 

25 gctgtgcttg ccaggcctgg ctccagcctg ttctcctggc acccggtgct tatgtctttg 300 

gctttctcct tcctgatgac cgaggcacta ctggtgtttt ctcctgagag ttcgctgctg 360 



-75- 

cactccctct cacggaaagg ccgagcacgc tgccactggg tgctgcagct gctggccctg 420 
ctgtgtgcac tgctgggcct cggccttgtc atcctccaca aagagcagct tggcaaagcc 480 
cacctggtta cgcggcatgg gcaggcaggg ctgctggctg tgctgtgggc agggctgcag 540 
tgctcaggtg gggtggggct gctctacccc aagctgctgc cccgatggcc cctggcgaag 600 

5 ctcaagctat accatgctac ttctgggctg gtgggctacc tgctgggtag tgccagcctc 660 
ttgctgggca tgtgctcact ctggttcact gcctctgtca ctggtgcagc ctggtacctg 720 
gctgtattat gccctgtcct caccagcttg gtcattatga accaggtgag caatgcctac 780 
ctataccgca agaggatcca accatgagct cttcccagcc taggggaagc ctggatttgc 840 
ccctccatgt aggagctggg cctagggacc tgttgaactc tctcagctga gtcaggggac 900 

10 acctcaggca ctgggacagt tgggcatttg gaggcccgtg tgtgaattcc tgctcctcat 960 
gctggagtgc ctcccatttc cttccccttt ctctgtcatc ccagaggaac ataggcatca 1020 
tgtgtctgga tgaagctggg gctgcaagac tgcctctcct gcaaggcagc tcatacttgt 1080 
actgtatgtt cagaaatttt aggagagaaa aaagtaa 1117 

15 

<210> 22 
<211> 4396 
<212> DNA 
<213> Homo sapiens 
20 <220> 

<221> transcription factor E2A 
<222> (1) . . (4396) 

<223> GenBank accession No. M31523 



25 <400> 22 

gcctgaggtg cccgccctgg ccccaggaga atgaaccagc cgcagaggat ggcgcctgtg 



60 



ggcacagaca aggagctcag tgacctcctg 
accaacggga agggccggcc cgcctccctg 
gaggaccggc ccagctcagg ctcctggggc 
cccagccgga ccttcagcga gggcacccac 
5 tccacattcc tgggaccggg actcggaggc 
ttcgggagag acgcaggcgt gggcggcctg 
gccctcaaca gccccgggcc cctgtcccct 
ccctcctact ccggcagctc ccggcggaga 
aagaaggtcc ggaaggtccc gccgggtctt 

10 gaggactacg gcagggatgc caccgcctac 
cccgccccct tctacgtggc agatggcagc 
ccgggccagg cgggcttcgg gcccatgctg 
cccggtagcg gcccggtggg cagcagtgga 
cacgagcgta tgggctacca gctgcatgga 

15 tcctccttct cctcagcccc cggagccacg 
gtcagcgggg ccgacagcct cctgggctcc 
gccctcggca aagcactggc ctcgatctac 
tccagccctt ctacccccgt gggctccccc 
cgagcaggag cccccggtgc cttatcgccc 

20 agtaagatag aagaccacct ggacgaggcc 
acagccggcg acatgcacac gctgctgcct 
ggccccatgt cgctgggtgg gcggcacgca 
ggcctcgcag gcagcaccag cctcatgcac 
accctccctg acctgtctcg gcctcccgac 

25 acggcggccg ccagcgagat caagcgggag 
gctgaccact cggaggagga gaagaaggag 



-76- 

gacttcagca tgatgttccc gctgcctgtc 120 

gccggggcgc agttcggagg ttcaggtctt 180 

agcggcgacc agagcagctc ctcctttgac 240 

ttcactgagt cgcacagcag cctctcttca 300 

aagagcggtg agcggggcgc ctatgcctcc 360 

actcaggctg gcttcctgtc aggcgagctg 420 

tcgggcatga aggggacctc ccagtactac 480 

gcggcagacg gcagcctaga cacgcagccc 540 

ccatcctcgg tgtacccacc cagctcaggt 600 

ccgtccgcca agacccccag cagcacctat 660 

ctgcacccct cagccgagct ctggagtccc 720 

ggtgggggct catccccgct gcccctcccg 780 

agcagcagca cgtttggtgg cctgcaccag 840 

gcagaggtga acggtgggct cccatctgca 900 

tacggcggcg tctccagcca cacgccgcct 960 

cgagggacca cagctggcag ctccggggat 1020 

tccccggatc actcaagcaa taacttctcg 1080 

cagggcctgg caggaacgtc acagtggcct 1140 

agctacgacg ggggtctcca cggcctgcag 1200 

atccacgtgc tccgcagcca cgccgtgggc 12 6 0 

ggccacgggg cgctggcctc aggtttcacc 132 0 

ggcctggttg gaggcagcca ccccgaggac 1380 

aaccacgcgg ccctccccag ccagccaggc 1440 

tcctacagtg ggctagggcg agcaggtgcc 1500 

gagaaggagg acgaggagaa cacgtcagcg 1560 

ctgaaggccc cccgggcccg gaccagccca 162 0 



gacgaggacg aggacgacct tctcccccca 
cgggtggcca ataacgcccg ggagcggctg 
gagctggggc gcatgtgcca actgcacctc 
atcctgcacc aggctgtctc ggtcatcctg 

5 ctgaatccca aagcagcctg tttgaaacgg 
ggagaccccc agatggtgct ttcagctccc 
gccgggcaca tgtgaaaggt atgcctccgt 
tctggcccca gaagccggac tcgagacccc 
ctgttgtcag catcgagcca acaccaacct 

10 ggtgacactg ggtccaggag ctccctgggg 
ccctgtcccc gaatctcagc caccgtgtca 
tgcatcttgg ccctgttgcc tgggctgaca 
acctgaaagc aagcaacaaa acatacactt 
aaaaagcgga gaaatggaaa catatcactc 

15 cttctaaagc caccagcaaa ttgtgcctaa 
cattagttac aagatttttt ttttcttaag 
tggtctctgg tttttttaag ctttttttgc 
ataaaagtcc cgtgtctctc gctatttctg 
ttgatcaacc agatgattaa acgttgtatt 

20 ccgtcccccc ccccggaagc cactgcacac 
gtttttttga tgttgggggt tctcttggtt 
ccctgatgaa ggcaccttgg ggcttcectg 
acctgggctt gggggaaggg acgtcagcaa 
caagcagctg tggccccagc ggtaccctac 

25 ctctcccttg gaacctgcac agctctctga 
ctggcatggg gaccacaccc tggagcccgg 



-77- 

gagcagaagg ccgagcggga gaaggagcgc 1680 

cgggtccgtg acatcaacga ggcctttaag 1740 

aacagcgaga agccccagac caaactgctc 18 00 

aacttggagc agcaagtgcg agagcggaac 1860 

cgagaagagg aaaaggtgtc aggtgtggtt 1920 

cacccaggcc tgagcgaagc ccacaacccc 1980 

gggacgagcc acccgctttc agccctgtgc 2040 

gggcttcatc cacatccaca cctcacacac 2100 

gacaaggttc ggagtgatgg gggcggccaa 2160 

ccctggccta ccactcactg gcctcgctcc 2220 

ctctgtgacc tgtcccatgg atcctgaaac 2280 

ggagcatttt ttttttttcc agtaaacaaa 2340 

tgtcagagaa gaaaaaaatg ccttaactat 2400 

aagggggatg ctgtggaaac ctggcttatt 2460 

gcgaaatatt ttttttaagg aaaataaaaa 2520 

gtagatgaaa attagcaagg atgctgcctt 2580 

atatgttttg taagcaacaa atttttttgt 2640 

ctgctgttcc tagactgagc attgcatttc 2700 

aaaaagaccc cgtgtaaacc tgagcccccc 27 6 0 

agacagacgg ggacaggcgg cgggtctttt 2820 

ttgtcatgtg gaaagtgatg cgtgggcgtt 2880 

ccgcatcctc tcccctcagg aaggggactg 2940 

ggtggctctg accctcccag gtgactctgc 3000 

acaacgccct ccccaggccc ccctaagctg 3060 

aatggggcat tttgttggga ccagtgaccc 3120 

tgctggggac ctcctggaca ccctgtcctt 3180 



-78- 

cactccttgc cccagggacc caggctcatg ctctgaactc tggctgagag gagtctgctc 3240 

aggagccagc acaggacacc ccccacccca ccccaccatg tccccattac accagagggc 33 00 

catcgtgacg tagacaggat gccaggggcc tgaccagcct ccccaatgct ggggagcatc 33 60 

cctggcctgg ggccacacct gctgccctcc ctctgtgtgg tccaagggca agagtggctg 3420 

5 gagccggggg actgtgctgg tctgagcccc acgaaggcct tgggctgtgg ctccgaccct 3480 

gctgcagaac cagcagggtg tcccctcggg cccatctgtg tcccatgtcc cagcacccag 3540 

gcctctctcc aggtctcctt ttctggtctt ttgccatgag ggtaaccagc tcttcccagc 3600 

tggctgggac tgtcttgggt ttaaaactgc aagtctccta ccctgggatc ccatccagtt 3660 

ccacacgaac tagggcagtg gtcactgtgg cacccaggtg tgggcctggc tagctggggg 3720 

10 ccttcatgtg cccttcatgc ccctccctgc attgaggcct tgtggacccc tgggctggct 3780 

gtgttcatcc ccgctgcagg tcgggcgtct ccccccgtgc cactcctgag actccaccgt 3840 

tacccccagg agatcctgga ctgcctgact cccctcccca gactggcttg ggagcctggg 3900 

ccccatggta gatgcaaggg aaacctcaag gccagctcaa tgcctggtat ctgcccccag 3960 

tccaggccag gcggagggga ggggctgtcc ggctgcctct cccttctcgg tggcttcccc 4020 

15 tgcgccctgg gagtttgatc tcttaaggga acttgcctct ccctcttgtt ttgctcctgc -'4080 

cctgccccta ggtctgggtg gcagtggccc catagcctct ggaactgtgc gttctgcata 4140 

gaattcaaac gagattcacc cagcgcgagg aggaagaaac agcagttcct gggaaccaca 42 00 

attatggggg gtggggggtg tgatctgagt gcctcaagat ggttttcaaa aaattttttt 4260 

taaagaaaat aattgtatac gtgtcaacac agctggctgg atgattggga ctttaaaacg 432 0 

20 accctctttc aggtggattc agagacctgt cctgtatata acagcactgt agcaataaac 4380 

gtgacatttt ataaag 4396 



<210> 23 
25 <211> 4133 
<212> DNA 



-79- 

<213> Homo sapiens 
<220> 

<221> KIAA0691 
<222> (1) . . (4133) 
5 <223> GenBank accession No. AB014591 



<400> 23 

aaattctccc tgggggtaag gtaccagccc tgtcctttat gggcttcttg ttctaaagca 60 

tatccgtccc atatggttgc tgctagtcac atgtggtgat tagtaactag ttaaaaatga 120 

10 aaaattcagt tcctccatta cacttgccac atttcagatg ttcagtggcc aacagatatg 180 

cgcaaataga gtgtttccag cattgcaaag ttctgttgga tagcactgtt tgccagatgt 240 

tcccttcttt gtgggtgagg actcttttgg tgtgacttcc ctctgtattg aggctcttgt 300 

tcctcagtat ggggctgttt ctgtctttac agtaagtgac tactccaggg ttccctgccc 360 

tgcacacgta gagtgggagc ggcccgtgga tcccagggaa ctgtgctttt cattgtaggc 42 0 

15 cccctccctg gaggggaaga gggcaatctc cgctggtatc tcagaagtct tcttctgagg 480 

cataagcctc tcttcccagg gctcccctgg tctcgctgtc aggccctaag gtatgtcttc 540 

ccttggacta aagctccttg gaactccctt ttgacctcag tcttctctgg gttccaggta 600 

acttccttta aaataaagac gctcctctct tgaagttttg ggttcctgcc ctgatggtct 660 

atgtctccct gactctaaat taccaatcca cttgctatgg gattcctcca tgagtgcaga 720 

20 ttggctccct cacagctgcg gtacctttgc accctcttat cttagtaaga tttctgtctt 780 

ctcccaggtc tctcttgggt actgccttct gcccccaaat ctctaagcct tcttggtatt 840 

agcttctttg ggttaggagt gttatttcct tttggtttaa ggatcctgct ctggaataaa 900 

tgtcttggtg gtttgagtcc cttctacttg gcattcagcc ctgtctgcat gagcgggttc 960 

agctcttcac agctttcggc atctctgctc gccgtcgttt tcccccaccc ccaatctttc 1020 

25 ttctcctacc tacagcttac acacacacac acacacacac acacacacac acgcccttct 1080 

ctgtgagctg ccagtttcat ttgtctcctg acttgtctga gggatgacct ctcctagcca 1140 



cctctgccca gcccctctga gtaggaagtg 
tcatcagctg tgtgcagcat gactgtcctg 
gggaaggtac tccatgctct aggaattttc 
tatgtaactt ttcttgtttc aacaaactta 
5 gggcagtcaa gcgagcatca ggagaactgg 
cttctcttta ccagtcccac ctacctcact 
taccagccag ggaatacgtg ttaattcctc 
agagagtatg aagagagtgc gtctgtaggg 
aaggtgagat tgatcgctgc ctcaagaagg 

10 tttggcagaa gctccacaat gcagccaacg 
taaagaagga gattaagaag ctacaacggc 
ccaacgagat caaggacaag aggcagctta 
tggaacggtt caaagttgtg gaacgagaga 
tgggcctggc ccagaaggta gatcctgccc 

15 tcacgaatac catcgacacg ctcaacatgc 
cactgtcagt gcagacacgc aagaagaagg 
gcttgaagcg gcacatcgag aagcaccgct 
gcatgctgga caatgactcc atcctcgttg 
agtactatgt tgactcatcc caggaccccg 

20 acctggacct cgaggacatt ccacaggcgc 
acatggagga tgagatcttc aaccagtcca 
ctcccatccc gcccagccca gccaactgta 
ggggacgttc cacagacagt gaagtcagcc 
tccacagcaa ccagcaccct cagtccccag 

25 cgcctgctgc ctctgccttg agcaccactc 
cacccccaag tgccctgggc cccaaggcca 
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tgatttccag ggctaatgcc tccatcccag 1200 

ctctgaaaaa cctttt tgag tgtattctgg 1260 

cacttcctga gtcagaggca cacaaaaaag 13 20 

tggggtcccc tgttggccag acactatgct 1380 

ggctggtctc. ttgtcagata gcaaatgctt 1440 

atgctgacta ggtccatgtc tctgggtttt 1500 

tccaatctct cctagcagcg tccgtctcca 1560 

cagggaagat ggcggacaag cgcaaactcc 1620 

tgtccgaggg cgtggagcag tttgaagata 1680 

cgaaccagaa agaaaagtat gaggctgacc 1740 

tgagggacca aatcaagaca tgggtagcgt 1800 

tagacaaccg caagctcatt gagacgcaaa 1860 

ccaaaaccaa agcttacagc aaagagggcc 1920 

agaaggagaa ggaagaggtt ggccagtggc 1980 
aggtggacca gtttgagagt gaagtggagt • 2040 

gcgacaagga taagcaggac cggattgagg 2100 

accacgtgcg catgctagag accatcctgc 2160 

acgccatccg caagatcaag gacgacgttg 2220 

acttcgagga gaacgagttt ctctacgatg 2280 

tggtcgccac ctcccccccc agccacagcc 2340 

gcagcacgcc cacctcaacc acctccagct 2400 

ccacggaaaa ctctgaagat gataagaaga 2460 

agtctccagc caaaaacggc tccaagcctg 2520 

ctgtgccgcc cacctacccc tccggccccc 2580 

ctggcaacaa tggggtcccc gcccccgcag 2 640 

gtccagctcc cagccacaac tcgggcaccc 2700 
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ctgctcccta 


tgcccaggct 


gtggccccac 


cagctcccag 


tgggcccagc 


acgacccay c 


z / ou 




cccggccccc 


cagcgtccag 


cctagcggag 


gcggaggcgg 


cggcagcgga 


ggcggaggga 






gcagcagcag 


tagtaacagc 


agtgccggtg 


gaggggctgg 


caagcagaat 


ggcgc caeca 


9 ft ft n 
z o o u 




gttacagctc 


agttgtggca 


gacagcccgg 


cagaggtggc 


tttgagcagc 


agtgggggca 


9 Q ACi 

c, y *± u 


5 


acaatgccag 


cagccaggcc 


ttgggccccc 


cttccggccc 


ccacaaccca 


cctcccagca 


JUUU 




cctcgaagga 


acccagtgcg 


gcagccccaa 


cgggggctgg 


gggcgtggcc 


ccaggctcag 


3UoU 




ggaacaactc 


agggggaccc 


agcctcctgg 


tgccactgcc 


tgtgaatcct 


cccagctccc 


312U 




caacgcccag 


cttcagtgat 


gccaaggcag ccggtgccct 


gctcaatggg 


cctccacagt 






tcagcaccgc 


cccagaaatc 


aaggcccctg 


agcctctgag 


ctccttgaag 


tecatggegg 


"5 o a r\ 


10 


aacgggcagc 


catcagctct 


ggcattgagg 


accctgtgcc 


aacgctgcac 


ctgaccgagc 


o o f\ r% 
33 00 




gagacatcat 


cctgagcagt 


acatcagcac 


ctccggcctc 


agcccagccg 


cccctgcagc 


3360 




tgtcagaggt 


gaacataccg 


ctgtcgctgg 


gtgtctgtcc 


actgggccct 


gtgcccctca 


3420 




ccaaggagca 


gctctatcag 


caggccatgg aagaggccgc 


ctggcaccac 


atgcctcacc 


3480 




cctctgactc 


tgagcgtatt 


cggcagtacc 


tcccccggaa 


ccc.ctgtccg 


acgcccccct 


3 D4U 


15 


accaccacca 


gatgccaccc 


ccacactcgg 


acactgtgga 


attctaccag 


cgcctgtcga 


*a c c\ c\ 




ccgagacact 


cttcttcatc 


ttctactatc 


tggagggcac 


taaggcacag 


tatctggcag 


o a c n 
ibbu 




ccaaggccct 


aaagaagcag 


tcatggcgat 


tccacaccaa 


gtacatgatg 


tggttccaga 


j / z u 




ggcacgagga 


gcccaagacc 


atcactgacg 


agtttgagca 


gggcacctac 


atctact t tg 


J / O KJ 




actacgagaa 


gtggggccag 


cggaagaagg 


aaggcttcac 


ctttgagtac 


cgctacctgg 




20 


aggaccggga 


cctccagtga 


caccggcccc 


tccctctacc 


cacccccttc 


ccccgcatgc 


jyuu 




tgatccccct 


gcccaggtga 


gggccctgcc 


ctggaagact 


ggagggaggc 


cccaagccac 


3960 




ggggcatccc 


cctctcccag 


gaagcaggga 


gggggccggg 


aggttttcct 


ctcagcccca 


4020 




ccctgggggc 


ccgggggcga 


gggctgcccc 


ctcctcccct 


ccccagtgag 


ggacattttt 


4080 




tggtaaacct 


attttcattt 


tggaaaatat 


ttatgaataa 


atagttttat 


atg 


4133 



25 
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<210> 24 
<211> 1260 
<212> DNA 
<213> Homo sapiens 
5 <220> 

<221> serotonin ID receptor 5-HT1D 
<222> (1) . . (1260) 

<223> Genank accession No. M81590 
<400> 24 

10 ccctctcctt cgtccgctcc atgcccaaga gctgcgctcc ggagctgggg cgaggagagc 60 

catggaggaa ccgggtgctc agtgcgctcc accgccgccc gcgggctccg agacctgggt . 120 
tcctcaagcc aacttatcct ctgctccctc ccaaaactgc agcgccaagg actacattta * 180 

ccaggactcc atctccctac cctggaaagt actgctggtt atgctattgg cgctcatcac 240 

cttggccacc acgctctcca atgcctttgt gattgccaca gtgtaccgga cccggaaact 3 00 

15 gcacaccccg gctaactacc tgatcgcctc tctggcagtc accgacctgc ttgtgtccat 360 

cctggtgatg cccatcagca ccatgtacac tgtcaccggc cgctggacac tgggccaggt 42 0 

ggtctgtgac ttctggctgt cgtcggacat cacttgttgc actgcctcca tcctgcacct 480 

ctgtgtcatc gccctggacc gctactgggc catcacggac gccgtggagt actcagctaa 540 

aaggactccc aagagggcgg cggtcatgat cgcgctggtg tgggtcttct ccatctctat 6 00 

20 ctcgctgccg cccttcttct ggcgtcaggc taaggccgaa gaggaggtgt cggaatgcgt 660 

ggtgaacacc gaccacatcc tctacacggt ctactccacg gtgggtgctt tctacttccc 720 

caccctgctc ctcatcgccc tctatggccg catctacgta gaagcccgct cccggatttt 780 

gaaacagacg cccaacagga ccggcaagcg cttgacccga gcccagctga taaccgactc 840 

ccccgggtcc acgtcctcgg tcacctctat taactcgcgg gttcccgacg tgcccagcga 900 

25 atccggatct cctgtgtatg tgaaccaagt caaagtgcga gtctccgacg ccctgctgga 960 

aaagaagaaa ctcatggccg ctagggagcg caaagccacc aagaccctag ggatcatttt 1020 
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gggagccttt attgtgtgtt. ggctaccctt cttcatcatc tccctagtga tgcctatctg 1080 

caaagatgcc tgctggttcc acctagccat ctttgacttc ttcacatggc tgggctatct 1140 

caactccctc atcaacccca taatctatac catgtccaat gaggacttta aacaagcatt 1200 

ccataaactg atacgtttta agtgcacaag ttgacttgcc gtttgcagtg gggtcgccta 1260 



<210> 25 
<211> 1990 
<212> DNA 
10 <213> Homo sapiens 
<220> 

<221> . coagulation factor XII (Hageman factor) 

<222> . (1) . . (1990) . . 

<223> GenBank accession No. M31315 

15 

<400> 2 5 

ctgctgctcc tggggttcct gctggtgagc ttggagtcaa cactttcgat tccaccttgg 
gaagccccca aggagcataa gtacaaagct gaagagcaca cagtcgttct cactgtcacc 



20 aagggccggc caggccctca gccctggtgt gctaccaccc ccaactttga tcaggaccag 
cgatggggat actgtttgga gcccaagaaa gtgaaagacc actgcagcaa acacagcccc 



cacctcactg gaaaccactg ccagaaagag aagtgctttg agcctcagct tctccggttt 



25 aagggtcctg atgcccactg ccagcggctg gccagccagg cctgccgcac caacccgtgc 
ctccatgggg gtcgctgcct agaggtggag ggccaccgcc tgtgccactg cccggtgggc 



60 
120 



ggggagccct gccacttccc cttccagtac caccggcagc tgtaccacaa atgtacccac 180 



240 
300 



tgccagaaag gagggacctg tgtgaacatg ccaagcggcc cccactgtct ctgtccacaa 3 60 



420 



ttccacaaga atgagatatg gtatagaact gagcaagcag ctgtggccag atgccagtgc 480 



540 
600 



tacaccggac ccttctgcga cgtggacacc 
agctaccgcg gcctggccag gaccacgctc 
gaggccacct accggaacgt gactgccgag 
gccttctgcc ggaacccgga caacgacatc 
5 cggctgagct gggagtactg cgacctggca 
ccgaccccgg tgtcccctag gcttcatgtc 
aagcctcagc ccacgacccg gaccccgcct 
aagcgggagc agccgccttc cctgaccagg 
cgcaagagtc tgtcttcgat gacccgcgtc 

10 cacccctaca tcgccgcgct gtactggggc 
ccctgctggg tgctgacggc cgctcactgc 
acggtggtgc tcggccagga acgccgtaac 
gtgcgctcct accgcttgca cgaggccttc 
ctgttgcgcc ttcaggagga tgcggacggc 

15 ccggtgtgcc tgccaagcgg cgccgcgcga 
ggctggggcc accagttcga gggggcggag 
gtaccgttcc tctccctgga gcgctgctca 
cccggcatgc tctgcgcagg gttcctcgag 
ggaggcccgc tggtgtgtga ggaccaagct 

20 atcagctggg gatcgggctg tggtgaccgc 
tactacctgg cctggatccg ggagcacacc 
cctccttggt gattccgcag tgagagagtg 
attcccccag tgcggccagc tccgcgccag 
aaatgctgag 

25 
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aaggcaagct gctatgatgg ccgcgggctc 660 

tcgggtgcgc cctgtcagcc gtgggcctcg 720 

caagcgcgga actggggact gggcggccac 780 

cgcccgtggt gcttcgtgct gaaccgcgac 840 

cagtgccaga ccccaaccca ggcggcgcct 900 

ccactcatgc ccgcgcagcc ggcaccgccg 960 

cagtcccaga ccccgggagc cttgccggcg 102 0 

aacggcccac tgagctgcgg gcagcggctc 1080 

gttggcgggc tggtggcgct acgcggggcg 1140 

cacagtttct gcgccggcag cctcatcgcc 1200 

ctgcaggacc ggcccgcacc cgaggatctg 1260 

cacagctgtg agccgtgcca gacgttggcc 1320 

tcgcccgtca gctaccagca cgacctggct 1380 

agctgcgcgc tcctgtcgcc ttacgttcag 1440 

ccctccgaga ccacgctctg ccaggtggcc 1500 

gaatatgcca gcttcctgca ggaggcgcag 1560 

gccccggacg tgcacggatc ctccatcctc 1620 

ggcggcaccg atgcgtgcca gggtgattcc 1680 

gcagagcgcc ggctcaccct gcaaggcatc 1740 

aacaagccag gcgtctacac cgatgtggcc 1800 

gtttcctgat tgctcaggga ctcatctttc 1860 

gctggggcat ggaaggcaag attgtgtccc 1920 

gatggcgagg aactcaataa agtgctttga 1980 

1990 
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<210> 26 

<211> 3840 

<212> DNA 

<213> Homo sapiens 
5 <220> 

<221> proto-oncogene tyrosine-protein kinase c-abl 

<222> (1) . . (3840) 

<223> GenBank accession No. M14752 



10 <400> 26 

ggccttcccc ctgcgaggat 
ttgggccggg ctcggcctcg 
gagggggtta aggcgcaggc 
ggccctttgt taacaggcgc 

15 ggcggcgggc ggcggtgagg 
gcctgagccg ggcccggacc 
caaaatgttg gagatctgcc 
gtcctccagc tgttatctgg 
tcagggtctg agtgaagccg 

20 tgaaaatgac cccaaccttt 
tctaagcata actaaaggtg 
gtgtgaagcc caaaccaaaa 
caacagtctg gagaaacact 
tccgctgagc agcgggatca 

25 ccagaggtcc atctcgctga 
ttctgatggc aagctctacg 



cgccgttggc ccgggttggc 
ggaacgccag gggcccctgg 
ggcggcgggg cgggggcggg 
gtcccggcca gcggagacgc 
gcggcctgcg gggcggcgcc 
gagctgggag aggggctccg 
tgaagctggt gggctgcaaa 
aagaagccct tcagcggcca 
ctcgttggaa ctccaaggaa 
tcgttgcacc gtatgatttt 
aaaagctccg ggtcttaggc 
atggccaagg ctgggtccca 
cctggtacca tgggcctgtg 
atggcagctt cttggtgcgt 
gatacgaagg gagggtgtac 
tctcctccga gagccgcttc 



tttggaaagc ggcggtggct 60 

gtgcggacgg gcgcggccag 120 

cctggcgggc gccctctccg 180 

ggccgccctg ggcgggcgcg 2 40 

cgggggccgg gccgagccgg 3 00 

gcccgatcgt tcgcttggcg 360 

tccaagaagg ggctgtcctc 420 

gtagcatctg actttgagcc 480 

aaccttctcg ctggacccag 540 

gtggccagtg gagataacac 600 

tataatcaca atggggaatg 660 

agcaactaca tcacgccagt 720 

tcccgcaatg ccgctgagta 780 

gagagtgaga gcagtcctag 840 

cattacagga tcaacactgc 900 

aacaccctgg ccgagttggt 960 



tea tea teat tcaacggtgg ccgacgggct 
gcgcaacaag cccactgtct atggtgtgtc 
cacggacatc accatgaagc acaagctggg 
cgtgtggaag aaatacagee tgacggtggc 

5 ggtggaagag ttcttgaaag aagctgcagt 
gcagctcctt ggggtctgea cccgggagcc 
etaegggaac ctcctggact acctgaggga 
getgetgtae atggccactc agatctegtc 
catccacaga gatcttgetg cccgaaactg 

10 agctgatttt ggectgagea ggttgatgac 
caagttcccc atcaaatgga ctgcacccga 
gtccgacgtc tgggcatttg gagtattget 
ttacccggga attgacegtt cccaggtgta 
gcgcccagaa ggctgcccag agaaggtcta 

15 tccctctgac cggccctcct ttgctgaaat 
atccagtatc tcagacgaag tggaaaagga 
gactaccttg ctgcaggccc cagagctgcc 
agagcacaga gacaccactg acgtgcctga 
cgatcctctg gaccatgagc ctgccgtgtc 

20 cccggagggc ggcctgaatg aagatgagcg 
gttcagcgcc ttgatcaaga agaagaagaa 
ctccttccgg gagatggacg gccagccgga 
agacatcagc aacggggcac tggctttcac 
cccaaagccc agcaatgggg ctggggtccc 

25 aggcttcegg tctccccacc tgtggaagaa 
caccggcgag gaggagggcg gtggcagctc 
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catcaccacg ctccattatc cagccccaaa 1020 

ccccaactac gacaagtggg agatggaacg 1080 

egggggecag tacggggagg tgtacgaggg 1140 

cgtgaagacc ttgaaggagg acaccatgga 1200 

catgaaagag atcaaacacc ctaacctagt 12 60 

cccgttctat atcatcactg agttcatgac 1320 

gtgeaacegg caggaggtga acgccgtggt 13 80 

agecatggag tacctagaga agaaaaactt 1440 

cctggtaggg gagaaccact tggtgaaggt 1500 

aggggacacc tacacagccc atgctggagc 1560 

gagcctggcc tacaacaagt tctccatcaa 1620 

ttgggaaatt gctacctatg gcatgtcccc 1680 

tgagctgeta gagaaggact acegcatgaa 174 0 

tgaactcatg cgagcatgtt ggcagtggaa 1800 

ccaccaagcc tttgaaacaa tgttccagga 186 0 

gctggggaaa caaggcgtcc gtggggctgt 1920 

caccaagacg aggacctcca ggagagctgc 1980 

gatgcctcac tccaagggcc agggagagag 2040 

tccattgctc cctcgaaaag agegaggtec 2100 

ccttctcccc aaagacaaaa agaccaactt 2160 

gacagcccca acccctccca aacgeagcag 2220 

gcgcagaggg geeggegagg aagagggccg 22 80 

ccccttggac acagctgacc cagccaagtc 2340 

caatggagee etcegggagt ccgggggctc 2400 

gtccagcacg ctgaccagca gccgcctagc 2460 

cagcaagcgc ttcctgcgct cttgctccgt 2520 



I 

•i 

I 
I 
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ctcctgcgtt 


ccccatgggg 


ccaaggacac 


ggagtggagg 


ccagccacgc 


CyCC tcgyya 


6 JOU 




cttgcagtcc 


acgggaagac 


agtttgactc 


gtccacattt 


ggagggcaca 


aaay tyayaa 


O St u 




gccggctctg 


cctcggaaga 


gggcagggga 


gaacaggtct 


gaccaggtga 


CCCyayyLaL 


9700 




agtaacgcct 


ccccccaggc 


tggtgaaaaa 


gaatgaggaa 


gctgctgatg 


ay y uc c tt-cia. 


2760 


5 


agacatcatg 


gagtccagcc 


cgggctccag 


cccgcccaac 


c tgactccaa 


aacccccccy 


282 0 




gcggcaggtc 


accgtggccc 


ctgcctcggg 


cctcccccac 


aaggaagaag 


cc tggaaagg 


OOOA 
ZoOU 




cagtgcctta 


gggacccctg 


ctgcagctga 


gccagtgacc 


cccaccagca 


aagcaggc tc 


z ^ *± u 




aggtgcacca 


aggggcacca 


gcaagggccc 


cgccgaggag 


tccagagtga 


ggaggcacaa 


jUUU 




gcactcctct 


gagtcgccag 


ggagggacaa 


ggggaaattg 


tccaagctca 


aacctgcccc 




10 


gccgccccca 


ccagcagcct 


ctgcagggaa 


ggctggagga 


aagccctcgc 


agaggcccgg 






ccaggaggct 


gccggggagg 


cagtcttggg 


cgcaaagaca 


aaagccacga 


gtctggttga 


t q r\ 
jloU 




tgctgtgaac 


agtgacgctg 


ccaagcccag 


ccagccggca 


gagggcctca 


aaaagcccgt 






gctcccggcc 


actccaaagc 


cacaccccgc 


caagccgtcg 


gggaccccca 


tcagcccagc 






ccccgttccc 


ctttccacgt 


tgccatcagc 


atcctcggcc 


ttggcagggg 


accagccgtc 


JJDU 


15 


ttccactgcc 


ttcatccctc 


tcatatcaac 


ccgagtgtct 


cttcggaaaa 


cccgccagcc 


•5 Aon 

J4ZU 




tccagagcgg 


gccagcggcg 


ccatcaccaa 


gggcgtggtc 


ttggacagca 


ccgaggcgct 


i /i p n 




gtgcctcgcc 


atctctggga 


actccgagca 


gatggccagc 


cacagcgcag 


tgctggaggc 






cggcaaaaac 


ctctacacgt 


tctgcgtgag 


ctatgtggat 


tccatccagc 


aaatgaggaa 


JDUU 




caagtttgcc 


ttccgagagg 


ccatcaacaa 


actggagaat 


aatctccggg 


agcttcagat 


3660 


20 


ctgcccggcg 


tcagcaggca 


gtggtccggc 


ggccactcag 


gacttcagca 


agctcctcag 


3720 




ttcggtgaag 


gaaatcagtg 


acatagtgca 


gaggtagcag 


cagtcagggg 


tcaggtgtca 


3780 




ggcccgtcgg 


agctgcctgc 


agcacatgcg 


ggctcgccca 


tacccatgac 


agtggctgag 


3840 



\ 

1 

25 <210> 27 

<211> 1628 



I 
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<212> DNA 

<213> Homo sapiens 

<220> 

5 <221> AFlq 

<222> (1) . . (1628) 

<223> GenBank accession No. U16954 
<220> 

10 <221> misc_feature 

<222> (114) . . (114) 

<223> undefined nucleotide 



<400> 27 

15 agtcagcacg ggggtgctgg aagagatcgg gaataatagc gcagaccaat gagcctaggg 60 

agatgctttc atcgtctctc cttccctcaa gtgttctgga acctatcatt tganttagcc 120 

gagtcaggca ggagggggcg gggaatcctt ccgcccttct taggaggggc tgcattgcag 180 

ggggagagtg aactgacaga ctcagtcact gaagagggaa aaggagtgag aagacaaagc 240 

cgtcaaagcc ccaacagctt tgtatttctc cagcccggcc ggcagacccc ggagctcccg 300 

20 aggcactccc tccatctttg gaacgcgcca gtaattgaat tgataacagg aagctatgag 360 

ggaccctgtg agtagccagt acagttcctt tcttttctgg aggatgccca tcccagaact 420 

ggatctgtcg gagctggaag gcctgggtct gtcagataca gccacctaca aggtcaaaga 480 

cagcagcgtt ggcaaaatga tcgggcaagc aactgcagca gaccaggaga aaaaccctga 540 

aggtgatggc ctccttgagt acagcacctt caacttctgg agagctccea ttgccagcat 600 

25 ccactccttc gaactggact tgctctaagg ccaagacttc tctctcccat caccttgccc 660 

tcattgtctt ccctctcaag ccccttcctt tccactcctt tcccatttta atcttgttct 720 

ctccctactg tgttggtggt gctgatgaat ctgccagagt tgagttctat gtatttattt 780 
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atctatctgt 


ctactccatt 


tctctcaaaa gccctcaagt 


cacaaag uaa 


-z» f rrrrt" t* raao 


840 


caatggagta 


ctgggtcaca 


crcrcrat tec tc 


ctttcccccc 


caaatatcaa 


CtCCayaaa^ 


900 


taggcctgac 


tggggacacc 


o ty cty »y ^-^y 


tatagtagtg 
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<211>. 993 

20 <212> DNA 

<213> Homo sapiens 
<220> 

<221> guanidinoacetate N-methyl transferase 

<222> (1)..(993) 

25 <223> GenBank accession No. Z49878 



-90- 

<400> 28 

cggcggcgcg cgatcgaggt cgggtcgccg tccagcctgc agcatgagcg cccccagcgc 60 
gacccccatc ttcgcgcccg gcgagaactg cagccccgcg tggggggcgg cgcccgcggc 120 
ctacgacgca gcggacacgc acctgcgcat cctgggcaag ccggtgatgg agcgctggga 180 
5 gaccccctat atgcacgcgc tggccgccgc cgcctcctcc aaagggggcc gggtcctgga 240 
ggtgggcttt ggcatggcca tcgcagcgtc aaaggtgcag gaggcgccca ttgatgagca 300 
ttggatcatc gagtgcaatg acggcgtctt ccagcggctc cgggactggg ccccacggca 3 60 
gacacacaag gtcatcccct tgaaaggcct gtgggaggat gtggcaccca ccctgcctga 420 
cggtcactct gatgggatcc tgtacgacac gtacccactc tcggaggaga cctggcacac 480 

10 acaccagttc aacttcatca agaaccacgc ctttcgcctg ctgaagccgg ggggcgtcct 540 
cacctactgc aacctcacct cctgggggga gctgatgaag tccaagtact cagacatcac 600 
catcatgttt gaggagacgc aggtgcccgc gctgctggag gccggcttcc ggagggagaa 660 
catccgtacg gaggtgatgg cgctggtccc accggccgac tgccgctact acgccttccc 720 
acagatgatc acgcccctgg tgaccaaagg ctgagccccc accccggccc ggccacaccc 780 

15 atgccctccg ccgtgccttc ctggccggga gtccagggtg tcgcaccagc cctgggctga 840 
tcccagctgt gtgtcaccag aagctttccc ggcttctctg tgaggggtcc caccagccca 900 
gggctgatcc cagctgtgtg tcaccagcag ctttcccagc ttgctctgtg agggtcactg 960 
ctgcccactg cagggtgccc tgaggtgaag ccg 993 
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<221> interferon-induced nuclear phosphoprotein 
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<222> (1)..(835) 

<223> Genbank accession No. as of 09 Dec 2002: L22342 
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<400> 29 

5 atggcgagca gcggagtcaa gaacacacca cgatggcgga gaaaagcccc tcatgggagg 
gaaaggaaag agaaaggaaa gaaaagaaaa agatgtatct ggtcaactcc aaaaaggaga 
cataagaaaa aaagcctccc aagagagatc attgatggca cttcagaaat gaatgaagga 
aagaggtccc agaagatgcc tagtacacca cgaagggtca cacaaggggc agcctcacct 
gggcatggca tccaagagaa gctccaagtg gtggataagg tgactcaaag gaaagacgac 
10 tcaacctgga actcagaggt catgatgagg gtccaaaagg caagaactaa atgtgcccga 
aagtccagat cgaaagaaaa gaaaaaggag aaagatatct gttcaagctc aaaaaggaga 
tttcagaaaa atattcaccg aagaggaaaa cccaaaagtg acactgtgga ttttcactgt 
tctaagtccc ccgtgacctg tggtgaggcg aaagggattt tatataagaa gaaaatgaaa 
cacggatcct cagtgaagtg cattcggaat gaggatggaa cttggttaac accaaatgaa 
15 tttgaagtcg aaggaaaagg aaggaacgca aagaactgga aacggaatat acgttgtgaa 
ggaatgaccc taggagagct gctgaagagt ggacttttgc tctgtcctcc aagaataaat 
ctcaagagag agttaaatag caagtgaatt tctactaccc tctcagtcac catgttgcag 
actttccctg tctggaggct caccttagag cttctgagtt tccaagcccg gaatt 
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